[Genome] BLAT match, score, percentage - how to select best alignment

Anton Kratz anton.kratz at googlemail.com
Wed Feb 28 21:13:36 PST 2007


Hi,

I am aligning some 10.000s of sequences against the human genome with BLAT
gfServer/gfClient. One sequence = one psl outputfile. Often I get several
alignments for each sequence. I would like to know how to best interpret
(parse) the psl-output; I only want to select one single (the best)
alignment. I am unsure how to determine which alignment is most probably the
"real" position of my sequence.



   - Can I just take the alignment that has the highest number in the
   "match"-column and take this entry as the "best" alignment?



   - The order in which the alignments appear in a psl-file, has this
   something to do with the quality of the alignment?



   - Why does the UCSC BLAT-website produce a score and percentage value
   (unlike the standalone BLAT), and what is the idea behind these values? I
   tried to understand what score and percentage value mean from looking at the
   source-code in the FAQ-Entry (how to reproduce those values), but do not
   understand the actual meaning of the values.


Thanks a lot,
Anton


More information about the Genome mailing list