[Genome] a bug?
Ann Zweig
ann at soe.ucsc.edu
Fri Nov 17 08:27:08 PST 2006
Hello Silvia,
This is an interesting question and I spent some time looking into it. And
now I have an explanation for you.
When I used Blast on the NCBI site to Blast the sequence from BC088267
against rat, I got a very good hit, as you did. The reason for this is because
it is Blasting against the entire genBank library of mRNAs. It does align well
to the mRNA, but I'm pretty sure that it's not really aligning to the actual rat
genome.
Now, when I use the UCSC browser to BLAT the sequence onto the rat assembly
(rn4), it BLATs very poorly, as you noticed. In fact this mRNA does not meet
our requirements for alignment to the rat genome. It does, however, meet our
less-stringent requirements for aligning to the human genome, which are: "Only
those alignments having a base identity level within 1% of the best and at least
25% base identity with the genomic sequence".
The reason that you cannot simply find a hit for BC088267 when you search
for it in the rat assembly is because it does not meet minimum alignment
criteria on this assembly. If you search instead for 'distrobrevin', you will
get a hit in the rat assembly, but the details page will explain that the mRNA
did not align well enough to the assembly.
If you view the Rat Chain track in the human assembly, at the location of
your mRNA, chr6:15,373,603-15,991,950, you will see that it is in the middle of
a large gap in a chain to the rat assembly on chr17. So, there is a large chain
from the human chr6 to the rat chr17, but this mRNA falls in a part of the human
sequence that does not align well at all to rat. If you click through from the
top (pink) section of the Rat Chain track to the details page, then click on
"View details of parts of chain within browser window." you see the gaps in the
rat assembly if you turn on the Gap track. The conclusion is, that the rat
assembly does not yet contain the DNA to match this mRNA.
I hope this helps explain the anomalies you noticed. Please be sure to
write back if you have more questions.
Regards,
----------
Ann Zweig
UCSC Genome Bioinformatics Group
http://genome.ucsc.edu
Silvia Jimenez wrote:
> Dear Sir
>
> Is there a problem with 'BLAT'?
>
> 1) I have the following GenBank/EMBL sequence BC088267; and I wanted to
> 'Blat' it but I retrieve no results (very bad matches on the genome). On
> the other hand if I take this sequence and perform a blast on the Rat
> genome at the NCBI the sequence perfectly matches the genome.(this
> happened only with Rat Blat).
>
> 2) There was another problem that seems to be resolved, after "blating"
> when one went to details and when one wanted to click on together, there
> was an error but it seems to be OK now.
>
> Many thanks for your useful tool, there is a life before and after Blat.
>
> Silvia
>
More information about the Genome
mailing list