[Genome] retrieving genomic information by NP_ accessions
Archana Thakkapallayil
archanat at soe.ucsc.edu
Mon Dec 4 12:37:35 PST 2006
Hello Yael,
This task can be accomplished using the Table Browser. The 'protAcc'
field is located in the 'refLink' table, but you can't get the 5' UTR
sequence directly using this table. To obtain this information, first
you need to find out the refGene ID's corresponding to your protein
accessions and then get the 5' UTR sequence for your refGenes.
To do this press the "Tables" link in the blue navigation bar across the
top of the browser window. Then make the following selections in the
Table Browser:
clade: vertebrate
genome: human
assembly: Mar. 2006
group: Genes and Gene Prediction Tracks
track: RefSeq Genes
table: refLink
click on "filter: create" button and then paste a white-space separated
list of your protAcc into the textbox "protAcc does match" and then
click "submit".
Back on the main page, set "output format: selected fields from primary
and related tables" and hit "get output" button.
On this page check the box for "mrnaAcc" from the refLink table and then
hit "get output". This gives you the list of refGene id's corresponding
to your protein accessions.
Now back on the Table Browser, select table "refGene" and region:
"genome" and then paste the list of your mrna accessions using the
"paste/upload" list buttons.
choose "output format: sequence" and hit "get output".
select "genomic" and click "submit".
On this page under 'Sequence Retrieval Region Option', check the box for
"5' UTR Exons" and then click "get sequence".
This gives you the 5' UTR sequence for the refGene ID's corresponding to
your protein accessions.
I hope this information is helpful to you. Please be sure to write back
if you need further instruction.
Regards,
Archana
UCSC Genome Bioinformatics Group
Yael Altuvia wrote:
> Hi,
>
> What would be the best way to get genomic sequences (e.g. 5' utrs) for a
> batch query
> Where the query entries are accessions of protAcc either NP_ or XP_
>
> thanks
> yael
>
>
>
> _______________________________________________
> Genome maillist - Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
>
More information about the Genome
mailing list