[Genome] BLAT gives wrong tStarts for translated searches.

Kevin M. Carr carrk at msu.edu
Fri May 4 14:53:34 PDT 2007


Hello all,

I am attempting to align RefSeq mRNA sequences from related species to an
in-house draft genome assembly.  I am doing the alignment in protein space
with the -t=dnax and -q=rnax.  While the results for alignments to the +
strand appear normal I am getting incorrect (or at least confusing) output
for alignments to the - strand of the target.  More specifically the
individual tStarts locations do not fall within the T start and T end
positions given.  Here is and example:

psLayout version 3

match   mis-    rep.    N's     Q gap   Q gap   T gap   T gap   strand  Q
Q       Q       Q       T               T       T       T       block
blockSizes       qStarts  tStarts
        match   match           count   bases   count   bases           name
size    start   end     name            size    start   end     count
----------------------------------------------------------------------------
----------------------------------------------------------------------------
-------
442     122     0       0       3       951     3       846     +-
SINFRUT00000182842      1599    78      1593    stig_26 225935  221426
222836  4       180,228,36,120,  78,288,552,1473,
3099,3330,3585,4389,

As can be seen, the T start is 221426 and T end is 222836 but the tStarts
are given as 3099,3330,3585,4389

It seems that every alignment to the - strand is incorrect while every one
to the + strand seems fine.  I have run this using command line BLAT (32X1)
and gfServer/gfClient (33X7) (gfServer started with -trans and gfClient run
with -q=rnax) and the results are the same either way.

Am I missing something?

Insights and suggestions greatly appreciated.
 
Kevin M. Carr

**************************
Bioinformatics Specialist
Research Technology
  Support Facility
202-D Biochemistry Bldg.
Michigan State University
East Lansing, MI  48824

Ph: (517) 353-6794
Fax:(517) 353-8638
**************************




More information about the Genome mailing list