[Genome] update of the UCSC data

Brooke Rhead rhead at soe.ucsc.edu
Tue Apr 8 12:34:02 PDT 2008


Hello Juliette,

You are correct: we do update our refGene and refLink tables nightly.
We get information for our tables from the NCBI records, searchable 
here: http://www.ncbi.nlm.nih.gov/sites/entrez?db=nucleotide .

In our refGene and refLink tables, we have three entries for 
INF2/C14orf151 genes (with three separate RefSeq "NM_" identifiers):

INF2
RefSeq: NM_001031714.3
http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db=nuccore&id=149999377

INF2
RefSeq: NM_022489.3
http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db=nuccore&id=149999379

C14orf151
RefSeq: NM_032714.1
http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db=nuccore&id=14249315

The third RefSeq Gene, NM_032714.1, is still listed at NCBI as having 
the name "C14orf151" at NCBI (look for the line: /gene="C14orf151" in 
the NCBI record).

I hope this explanation helps.  If you have further questions, please 
feel free to write back to the mailing list.

--
Brooke Rhead
UCSC Genome Bioinformatics Group


Juliette Aury Landas wrote:
> Good morning,
> 
> I have some questions about the update of the UCSC data.
> 
> You wrote me that you use the most recent RefSeq entry at NCBI, and you 
> update your refGene and refLink tables nightly for every assembly for 
> which you have a browser.
> 
> In refFlat and refLink file I still find that C14orf151 and INF2 are 2 
> distinct genes with 2 GeneID. In the other databank like the NCBI, 
> C14orf151 is an aliases (previous name) of INF2. I first thought that 
> the 2 databases (UCSC and NCBI) were not synchronized. That is why I 
> waited 10 days before comparing them again. But they still differ. Do 
> you know why ?
> 
> Thanks in advance,
> Juliette
> 



More information about the Genome mailing list