[Genome] Not all identities returned by blat

Brooke Rhead rhead at soe.ucsc.edu
Fri Dec 8 17:53:15 PST 2006


Hello Johanne,

I tried BLATing your example sequence against sacCer1 using different 
parameters.  I used -minIdentity=100 to get only matches with 100% 
identity.  Strangely, using the default parameters, I got 19 full-length 
matches, one more than you found.  I am using Standalone BLAT v. 33x5.

I got more 100% identity results by lowering the tileSize.  With 
-tileSize=6 I found 22 full-length, 100% hits.  With tileSize=7, I found 
20, and with tileSize of 8 to 11, I found 19 hits.

I should also point out that with web-based blat on our site (at 
http://genome.ucsc.edu/cgi-bin/hgBlat? ), your example query gives 20 
100% identity, full-length matches.  Of course, some of the results have 
a span greater than 46 (the length of the query).  We have advice on 
replicating web-based BLAT results here:

http://genome.ucsc.edu/FAQ/FAQblat#blat5

(actually, this title is somewhat misleading . . . these are 
instructions on replicating web-based BLAT using gfClient and gfServer, 
not blat.  You still may find them useful.)

Also, if by "perfect matches" you mean something other than the way I 
interpreted your meaning, please let us know, and we will try to advise 
you on how to alter the parameters.

-- 
Brooke Rhead
UCSC Genome Bioinformatics Group


Duhaime Johanne wrote:
> Hello
>  
> I am trying to find all perfect matches of 45-55 oligos on the
> Saccharomyces cerevisiae.
>  
> I would like to use blat instead of megablast because of the speed since
> I  have a lot of oligos.
>  
> A blast with a certain sequence will return 20 matches 100% identity on
> the full oligo lenght but with blat I get 18. And this happens for quite
> a lot of sequences. I always get less matches with blat. 
>  
> I am struggling with the parameters but I cannot get the same results.
>  
> Ex: TAGTCGCACTAGTCCTGACGTTGATGCTGGCAGTGGTAGTAGCACT  blast=20 blat=18
>  
> I am using the following : 
>  
> /apps/programs/bin/blat x11.seq ../../../sgd/all.fsa -out=blast8
> x11.blat
>  
> I have tried to change tileSize an stepSIze without success. 
> The matches for the above sequence that do not appear in the blat result
> are in the neighborhood of another match. 
>  
> Database, blast and  blat are installed locally. 
> The megablast command used: 
> /apps/programs/sources/blast-2.2.15/bin/megablast -d ../../../sgd/all -i
> x11.seq -o x11.blast -m 8 -F f -p 0
>  
> Is there a parameter that will conciliate the result or say differently
> how can I have all the perfect matches with blat?
>  
> Thank you very much in advance
>  
>  
> Johanne Duhaime
>  
> _______________________________________________
> Genome maillist  -  Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome


More information about the Genome mailing list