[Genome] Detail about table knownGene
Ann Zweig
ann at soe.ucsc.edu
Wed Apr 2 10:43:06 PDT 2008
Hello Samuel,
To create the Known Gene track for the May 2004 assembly (hg17), we used a
process we call KG II. You can read about the process by pressing on the
'mini-button' to the left of the actual track display, or by clicking on the
hyperlinked track name in the track controls (below the display). To cluster
together the genes in this track, we used a program from our source code called
hgClusterGenes. Then among the genes that overlap, the longest gene
is chosen to be the canonical gene to represent the cluster.
Since then, we have changed the way we create the Known Gene track. On the
next human assembly (hg18), we use the KG III process. You may want to take a
look at this track as well for comparison sake.
I hope this information is helpful to you. Please don't hesitate to contact
the mail list again if you require further assistance.
Regards,
----------
Ann Zweig
UCSC Genome Bioinformatics Group
http://genome.ucsc.edu
Please feel free to search the Genome mailing list archives by visiting our home
page, clicking on "Contact Us", then typing a word or phrase into the search
box. On that same page
(http://genome.ucsc.edu/contacts.html), you can subscribe to the Genome mailing
list.
Samuel GRANJEAUD - IR/IFR137 wrote:
> Hello!
>
> I am using May2004 assembly. I was wondering what is the rational behind
> cluster that links together entries in the knownGene table.
>
> Best regards.
>
> _______________________________________________
> Genome maillist - Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
More information about the Genome
mailing list