[Genome] help with a gene list

Ann Zweig ann at soe.ucsc.edu
Fri Dec 1 11:54:37 PST 2006


Hello Omid,

	You can obtain such a tab-delimited file by using the Table Browser on our 
website.  Press the "Tables" link in the blue navigation bar across the top of 
the browser window.  I will give you step-by-step instructions for the first
500,000 bases of chrX -- you can extrapolate from there.  We have several gene 
annotation tracks on the browser, I will explain how to use the Known Gene 
track, but if you want to use another track, you can change the 'track' and 
'table' selections in the instructions.

Configure the Table Browser like so:

genome: Human
assembly: Mar. 2006
group: Genes and Gene Prediction Tracks
track: Known Genes
table: knownGene
position: chrX:1-500000
output format: selected fields from primary and related tables

	Press "get output" button.  From this page, choose the fields from the 
knownGene table that you would like to view.  In your case: name, chrom, strand, 
either or both txStart/cdsStart, txEnd/cdsEnd.  This will provide you all of the 
information you asked for except "other aliases".

	To include other aliases, you will need to scroll down this page and click on 
the table named "kgXref".  This table includes other gene names such as 
SWISS=PROT, RefSeq, etc.  After checking the kgXref box, scroll to the bottom of 
the page and press the "Allow Selection From Checked Tables" button.  Now, in 
the hg18.kgXref section, select any other names you would like to see in your 
output.

	Depending on your selections, your output will look something like this:


#hg18.knownGene.name	hg18.knownGene.chrom	hg18.knownGene.strand 
hg18.knownGene.txStart	hg18.knownGene.txEnd	hg18.kgXref.mRNA	hg18.kgXref.spID 
hg18.kgXref.spDisplayID	hg18.kgXref.geneSymbol	hg18.kgXref.refseq

NM_018390	chrX	+	132991	160020	NM_018390	Q9NUJ7	Q9NUJ7_HUMAN	PLCXD1	NM_018390

NM_199326	chrX	-	214971	222590	NM_199326	Q96H01	Q96H01_HUMAN	PPP2R3B	NM_199326

NM_013239	chrX	-	214971	267627	BC063429,NM_013239,	Q9Y5P8,Q9Y5P8, 
2ACC_HUMAN,2ACC_HUMAN,	PPP2R3B,PPP2R3B,	
NM_013239,NM_013239,

BC063429	chrX	-	214975	267445	BC063429	Q96FD8	Q96FD8_HUMAN	PPP2R3B	NM_013239

BC063429	chrX	-	214975	267445	BC063429,NM_013239,	Q9Y5P8,Q9Y5P8, 
2ACC_HUMAN,2ACC_HUMAN,	PPP2R3B,PPP2R3B,	NM_013239,NM_013239,


	I hope this helps you get started using the UCSC Genome Browser.  Please don't 
hesitate to write back if you need more guidance.

Regards,

----------
Ann Zweig
UCSC Genome Bioinformatics Group
http://genome.ucsc.edu


omid gulban wrote:
> Hello All,
> 
> I am a new user of the UCSC genome browser system.
> 
> I would like to optain a tab-delimited file containing the following information from the most recent Human genome.
> 
> gene name
> other aliases
> chromosome
> start
> end
> strand
> 
> Thank You
> Omid
> _______________________________________________
> Genome maillist  -  Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome




More information about the Genome mailing list