[Genome] a question about dbSNP126 in genome browser and dbSNP127 at NCBI

Brooke Rhead rhead at soe.ucsc.edu
Fri Aug 24 16:11:48 PDT 2007


Hello Wenzhong,

We actually did not do any additional annotation on the function of the 
SNPs -- these annotations are from NCBI.  (We did do some additional 
annotations, but not of the 'function' field.)  If you look at the 
snp126 details page, our NCBI data sources are listed.  (To get to the 
details page, click on the track name, "SNPs (126)," just above the 
track control in the Genome Browser).  Scroll down to the "Data Sources" 
section to see which files from NCBI we used for the various fields in 
the snp126 table.  Here is the pertinent detail from that page for the 
functional classification information:

Functional classification was obtained from 
b126_SNPContigLocusId_36_1.bcp.gz.

The UCSC annotations are also described on the track details page.  The 
table that corresponds to our annotations is called 'snp126Exceptions'. 
  There is another small hg18 table called 'snp126ExceptionDesc' that 
describes what each of the exceptions in the first table means.

I hope this information is helpful.  Please write back to this list if 
you have any further questions.

--
Brooke Rhead
UCSC Genome Bioinformatics Group



wenzhong wrote:
> Hello , 
> 
>  
> 
> Can I please ask a question about dbSNP126 included in UCSC genome browser?
> 
>  
> 
> We did a brief comparison of the validated coding SNPs  in the dbSNP126 of
> UCSC and the dbSNP127 at NCBI: the # of common SNPs - 83046 IDs; unique in
> db126 of UCSC - 10838 IDs, and unique in db127 of NCBI- 16993 IDs. We looked
> at the 10838 IDs unique in db126 of UCSC and apparently most (if not all of
> them) do not have an annotation on 'Function' at NCBI, so your team
> apparently has done additional annotations on the SNPs. Can I please ask if
> there is a reference on these annotations, and is it based on RefSeq or UCSC
> KG? 
> 
>  
> 
> Thanks very much for the help!
> 
>  
> 
> --------------------------------------------------------------------------
> 
> Here is some additional details,
> 
>  
> 
> validated coding SNPs in dbSNP127 at NCBI:
> 
> synon 28675
> 
> nonsynon 28051
> 
> utr   43281
> 
> total 100039 (synon and nonsyn has 168 overlaps)
> 
>  
> 
> validates coding SNPs in dbSNP126 in Genome Browser:
> 
> synon 25677 (25294 unique IDs: 24927 IDs have unique genome mapping, 367 IDs
> have been mapped to multiple genomic location)
> 
> nonsynon 25505 (24878 unqiue IDs: 24281 IDs have unique genome mapping, 597
> IDs have been mapped to multiple genomic location)
> 
> utr  44297 (43894 unique IDs: 43511 IDs have unique genome mapping, 383 IDs
> have been mapped to multiple genomic location) 
> 
> total 93884 (synon and nonsyn has 182 overlaps)
> 
>  
> 
> common in two: 83046 IDs
> 
> db126.ni.db127 10838
> 
> db127.ni.db126 16993
> 
> ------------------------------------
> 
>  
> 
> Best regards,
> 
>  
> 
> Wenzhong
> 
> _______________________________________________
> Genome maillist  -  Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome


More information about the Genome mailing list