[Genome] Coding exons of genes in the human genome

Brooke Rhead rhead at soe.ucsc.edu
Wed Jan 9 13:34:09 PST 2008


Hi Siddarth,

Are you using the hg18 database?  I do not see any entries in the 
hg18.kgXref table like '1-Mar' or 'C10orf10'.

Is the kgXref table you are using identical to the one located here?:
http://hgdownload.cse.ucsc.edu/goldenPath/hg18/database/kgXref.txt.gz

--
Brooke Rhead
UCSC Genome Bioinformatics Group


SiddarthGautham Selvaraj wrote:
> Hi,
> 
> 
> I am a graduate student from Arizona State University. I am trying to
> extract the list of canonical genes from KnownCanonical table. To do that, I
> first need to organize genes based on gene symbols.
> 
> The values contained as gene symbol in kgXref table are not very precise. It
> has entries like
> 
>  1-Mar  2-Mar  3-Mar  4-Mar  5-Mar  6-Mar  7-Mar  8-Mar  9-Mar  1-Sep  2-Sep
> 3-Sep  4-Sep  5-Sep  6-Sep  7-Sep  8-Sep  9-Sep  10-Sep  11-Sep  12-Sep
> 15-Sep  1-Dec and
> 
>  C10orf10  C10orf104  C10orf107  C10orf108  C10orf11  C10orf110  C10orf111
> C10orf113  C10orf114  C10orf116  C10orf118  C10orf119  C10orf12  C10orf120
> C10orf122  C10orf125  C10orf129  C10orf130  C10orf132  C10orf137  C10orf18
> C10orf22  C10orf25  C10orf26  C10orf27  C10orf28  C10orf30  C10orf32
> C10orf33  C10orf35  C10orf38  C10orf4  C10orf46  C10orf47  C10orf49
> C10orf53  C10orf54  C10orf55  C10orf56  C10orf57  C10orf58  C10orf59
> C10orf6  C10orf61  C10orf62  C10orf63  C10orf64  C10orf65  C10orf67
> C10orf68  C10orf71  C10orf72  C10orf73  C10orf76  C10orf78
> which I guess are junk entries. Thus I would require HGNC symbols from some
> other table so that I can compare these values and get only the proper gene
> symbols and hence proper genes.
> 
> Please let me know which table has the HGNC symbols.
> 
> Please help me in this.
> 
> Thanks
> siddarth
> 
> On 08/01/2008, Ann Zweig <ann at soe.ucsc.edu> wrote:
>> Hello Siddarth,
>>
>>         If you have successfully extracted the sequence for all Known
>> Genes, but you
>> only want the sequence for the canonical gene in each gene cluster, then
>> my
>> suggestion would be to use the output from the knownCanonical table to
>> filter
>> out those genes from your list that you are not interested in.
>>
>>         If this does not address your issue, please feel free to write
>> back to the
>> genome list.
>>
>>
>> Regards,
>>
>> ----------
>> Ann Zweig
>> UCSC Genome Bioinformatics Group
>> http://genome.ucsc.edu
>>
>> Please feel free to search the Genome mailing list archives by visiting
>> our home
>> page, clicking on "Contact Us", then typing a word or phrase into the
>> search
>> box.  On that same page
>> (http://genome.ucsc.edu/contacts.html), you can subscribe to the Genome
>> mailing
>> list.
>>
>>
>>
>>
>> SiddarthGautham Selvaraj wrote:
>>> Hi:
>>>
>>> I am siddarth, a graduate student in Arizona State University. I have
>> been
>>> trying to get the coding exon sequences of all genes in the human
>> genome.
>>> I define genes as an unit of the genome which codes for something. I do
>> not
>>> need gene variants. That is a gene may get alternatively spliced to
>> produce
>>> different products. I am not bothered on the variants a gene can produce
>>> through alternate splicing.
>>>
>>> I tried KnownCanonical Table but I guess it does not focus on all exons.
>>>
>>> Thus where can I get information on coding exons of all genes in the
>> human
>>> genome.
>>>
>>> Awaiting Your reply
>>>
>>> Thanks
>>> siddarth
>>> _______________________________________________
>>> Genome maillist  -  Genome at soe.ucsc.edu
>>> http://www.soe.ucsc.edu/mailman/listinfo/genome
> _______________________________________________
> Genome maillist  -  Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome


More information about the Genome mailing list