[Genome] EST sequences
Brooke Rhead
rhead at soe.ucsc.edu
Thu Sep 13 18:08:08 PDT 2007
Hello Vidar,
To display ESTs from a particular library in the Genome Browser, first
turn on the "Human ESTs" track (in the "mRNA and EST Tracks" section).
Then go to the track control page, either by hitting the small button on
the far left-hand side of the track, or by clicking the blue "Human
ESTs" link right above the track control. The track control page
contains several options for filtering the data displayed in the EST
track, one of which is "library". Enter the library name of interest in
the box. You may need to add wildcard characters to the library names,
as we store longer names than your example in our database. For
instance, the library name "2NbHMSP" used as a filter term yields no
results, but the name "*2NbHMSP" matches these two libraries:
Soares 2NbHMSP
Soares_multiple_sclerosis_2NbHMSP
If you click on the EST in the Genome Browser, you will be taken to a
details page, where there is a link to the EST sequence.
To get a lot of EST sequences at once, you will need to download the
entire file of EST sequnces that we get from GenBank. (The Table
Browser can be used to retrieve genomic sequence from the areas where
ESTs align, but the EST sequence itself is not stored in a table and is
not available via the Table Browser.) The file is located here:
http://hgdownload.cse.ucsc.edu/goldenPath/hg18/bigZips/est.fa.gz
Be aware that this is a very large file (~1.4G). The file is in FASTA
format.
The Table Browser *could* be used to get a list of the EST accession
numbers in a particular library, but due to the huge size of the all_est
table and the complexity of the query, it is very time-consuming to use
it for this purpose. It is much easier to use our public MySQL
database to obtain the information:
http://genome.ucsc.edu/FAQ/FAQdownloads#download29
Here is an example MySQL query that will get you a list of accessions in
the second library listed above:
mysql> SELECT gbCdnaInfo.acc FROM gbCdnaInfo, library WHERE
gbCdnaInfo.type='est' AND gbCdnaInfo.library=library.id AND
library.name='Soares_multiple_sclerosis_2NbHMSP';
I hope this information is helpful. If you have further questions,
please feel free to contact this mailing list again.
--
Brooke Rhead
UCSC Genome Bioinformatics Group
Vidar Blikstad wrote:
> Hello!
>
> I´m a user of the table browser - can you help me to obtain EST
> sequences from a UniGene library (i.e.2NbHMSP). How to display these
> at the genome browser?
>
> Sincerely
>
> Vidar Blikstad
> Uppsala university
>
>
> _______________________________________________
> Genome maillist - Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
More information about the Genome
mailing list