[Genome] Blat 35-mer perfect matches
Ann Zweig
ann at soe.ucsc.edu
Thu Aug 16 09:49:40 PDT 2007
Hello Derek,
The -minMatch=N sets the number of tile matches. The default is 2 for
nucleotide, 1 for protein. This is the number of contiguous tile hits necessary
to seed an alignment attempt. Setting it higher is probably not good for
sensitivity.
Perhaps you should try the -minScore parameter, as well as setting the -stepSize=5.
After BLATting, try using the pslCDnaFilter program to filter the PSLs (start
with these parameters: pslCDnaFilter -minAlnSize-35 -minID=1 -minCover=1).
This should give you a place to start -- you'll have to tweak the parameters to
get the exact output you're looking for.
Regards,
----------
Ann Zweig
UCSC Genome Bioinformatics Group
http://genome.ucsc.edu
Derek Chiang wrote:
> What parameter settings on the standalone blat would you recommend to
> find perfect sequence matches of 35 bases in the human genome? I've
> tried the parameter settings of -minMatch=3 -minIdentity=100 but noticed
> that some query sequences lacked any hits. I assume that tiles with low
> complexity are filtered out, thus failing the -minMatch requirement.
>
> Thanks,
> Derek
> _______________________________________________
> Genome maillist - Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
More information about the Genome
mailing list