[Genome] question about UCSC 'Human ESTs Including Unspliced' track

Kayla Smith kayla at soe.ucsc.edu
Thu Oct 11 18:12:06 PDT 2007


Hello Xu Jerry,

Thank you for your close reading of the EST description page.

There are many genes with introns longer than 200kb.  In fact it's 
possible to use the Table Browser to make a Custom Track of just the 
introns of the UCSC Genes track, if you're interested in looking at 
introns.

Unspliced ESTs in the browser are not "pre-mRNAs".  They are ESTs where 
we don't find any canonical splice sites when we align the EST.  These 
could be single exon ESTs that will never be spliced, DNA contamination, 
or possibly even "pre-mRNAs" that got caught before splicing that are 
included the genbank data set.

Meanwhile, if you do find any data on our website that seems 
questionable to you, feel free to send us specific examples and we can 
look into them.  More often than not it's a case of us simply displaying 
what is available from genbank (we don't curate here!), but sometimes it 
is a bad alignment or filter.

I hope this information is helpful to you.  Please don't hesitate to 
contact us again if you require further assistance.

Kayla Smith
UCSC Genome Bioinformatics Group

xu_jianzhen wrote:
> Dear Genome Help,
>   I've a question about UCSC 'Human ESTs Including Unspliced' track.The methods of this track said,"To generate this track, human ESTs from GenBank were aligned against the genome using blat. Note that the maximum intron length allowed by blat is 750,000 bases, which may eliminate some ESTs with very long introns that might otherwise align".However,to my knowledge of biology,i do not think there are too much such long pre-mRNA that is more than 750kb, so my question is when I find a unspliced EST more than 200kb,does it mean this is a false positive?
>  
>  
> Thank you all!
>  
>  
>  
> Xu Jerry
> GIBH,CAS 
> 
> _______________________________________________
> Genome maillist  -  Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome



More information about the Genome mailing list