[Genome] How to get snps locate between the start and end chromosomal location of each gene with Entrez GeneID

Gao, Shouguo sgao at mcw.edu
Thu Jan 3 06:34:40 PST 2008


It is really useful to connect to mySQL server. Now I have a large snp list, and I am looking for the snps related to a gene list (Entrez Geneid), which locate between 1000 upstream and 500 downstream of genes. The most important thing is to find the start and end chromosomal location of each gene with Entrez GeneID. I used knowngene and knownToLocusLink tables, but got several locations for one Entrez GeneID (Locuslink). 

 

name    locuslink           chro     strand   start      end

uc001aab.1      375690            chr1     -           4558    14764

uc001aac.1      375690            chr1     -           4558    19346

uc001aad.1      375690            chr1     -           4558    7231

uc001aae.1      375690            chr1     -           4558    9622

uc001aaf.1       375690            chr1     -           4832    19672

uc001aag.1      375690            chr1     -           5658    7231

uc001aah.1      375690            chr1     -           6720    19346

uc001aai.1       375690            chr1     -           6720    9622

 

The reason should be that the same Entrez Gene ID matches multiple UCSC known genes (transcript).

 

In NCBI one Entrez GeneID only has one start and end location. Could you please tell me how to solve this issue? Is there any other table for that?

 

Thanks

 

Shouguo




More information about the Genome mailing list