[Genome] how to get human protein mapped by tBlastn?
Pauline Fujita
pauline.fujita at gmail.com
Fri May 30 18:57:34 PDT 2008
Hello Xianjun,
The human proteins track uses the table you mentioned, blastHg18KG, to
look up the identifiers of human proteins that align to a given region
of the zebrafish genome. These identifiers are listed in the column
"qName" of the blastHg18KG table. In your example, the identifiers
associated with the region chr14:4,399,990-4,421,902 are:
M19154
NM_003238
BC096236
The browser retrieves the geneIds of these identifiers from the table
"blastKGRef04" in the human, Mar 2006 (hg18) assembly. Normally, when
you click on the "describe table schema" button, in addition to
describing the table, all the related tables are listed. Unfortunately,
we do not list related tables from assemblies in *other* organisms, and
so you will not see this second table listed in the usual place.
To retrieve the IDs you are looking for, select in the table browser:
clade: vertebrate
genome: zebrafish
assembly: July 2007
group: All Tables
database: danRer5
table: blastHg18KG
region: click "position" and paste in the region you are interested
in searching
output format: selected fields from primary and related tables
then click on "get output". This will take you to a menu where you can
select which columns you want to appear in the output. Select "qName"
and click "get output".
This should output all the identifiers as a text list. Copy this list,
then return to the main table browser menu. Now select:
clade: vertebrate
genome: human
assembly: Mar 2006
group: All Tables
database: hg18
table: blastKGRef04
region: click "genome"
output format: selected fields from primary and related tables
identifiers: click on "paste list" then paste the list of identifiers
from zebrafish into the textbox and click "submit"
Then click on "get output" and in the "select fields from
hg18.blastKGRef04" section, click on "geneId". Finally, click on "get
output" and this should give the list of protein names you are looking for.
Hopefully this information was helpful and answers your question. If you
have further questions or require clarification feel free to contact the
mailing list at genome at soe.ucsc.edu <mailto:genome at soe.ucsc.edu>.
Regards,
Pauline Fujita
UCSC Genome Bioinformatics Group
http://genome.ucsc.edu
Xianjun Dong wrote:
> hi,
>
> I want to get the human protein name mapped by tBlastn in another genome
> browser? For example, when looking chr14:4,399,990-4,421,902 in
> zebrafish, it's mapped to human protein TGFB2 in the track of "Human
> Protein Mapped by Chained tBlastn". How can I get the name of 'TGFB2' by
> query some table?
>
> I looked into the schema of table blastHg18KG, which only has info of
> human mRNA mapped. From mRNA to protein info displaying on the track,
> where did you get the data?
>
> Regards,
>
>
More information about the Genome
mailing list