[Genome] question about ucsc genome browser?
Brooke Rhead
rhead at soe.ucsc.edu
Wed Dec 19 14:24:53 PST 2007
Hello Zd,
You can find the number of non-overlapping RefSeq Genes using the Table
Browser and the tools at Galaxy (which is run by Penn State University).
I will walk you through the steps.
In the Table Browser (click on "Tables" in the blue bar at the top of
the page), select the clade, genome, and assembly you wish to work with.
Then choose:
group: Genes and Gene Prediction Tracks
track: RefSeq Genes
table: refGene
region: genome
output format: all fields from selected table
Select the box next to "Send output to Galaxy". Now hit "get output"
and then "send query to Galaxy". You will be taken to the Galaxy web
site, where you can use their tools to manipulate data. You should see
the data from the RefSeq Genes table listed on the right side of the
page, and a list of tools on the left side.
Now click on the "Operate on Genomic Intervals" group in the tools
section and choose to "Merge the overlapping intervals of a query".
Select the refGene table in the drop-down menu and hit "execute". Once
the job has run, you can expand the item on the right-hand side and see
a count of the items. For the human (hg18) database, the total number
of genes in the refGene table before merging is 25,931, and after
merging it is 17,960.
If you would like to see the merged data as a custom track in the Genome
Browser, click on the Galaxy link to "display at UCSC main".
I hope this helps. If you have further questions, please feel free to
write back to us at this list.
--
Brooke Rhead
UCSC Genome Bioinformatics Group
zxu wrote:
> What is the total number of unique Ref seq genes (no overlapping exons) in the
> genome? how can I get this number?
>
> zd
>
> _______________________________________________
> Genome maillist - Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
More information about the Genome
mailing list