[Genome] Questions on update, dumps and SNPs of hg18 data subsets ---knownGene*, refGene* and SNP

Fangcheng Gong Fangcheng.Gong at sial.com
Wed Feb 14 11:26:37 PST 2007


Hi Heather,

Thank you very much for your effort.  Yes, you're right, I'm looking at 
single base substitutions. The coordinates should be relative to the start 
of each specific transcript instead of gene as some genes have multiple 
transcripts, and the coordinates of the same SNPs on individual 
transcripts might be different due to the splicing. 

Regards,
Fangcheng




Heather Trumbower <heather at soe.ucsc.edu> 
02/14/2007 12:55 PM

To
Fangcheng Gong <Fangcheng.Gong at sial.com>
cc
genome at soe.ucsc.edu
Subject
Re: [Genome] Questions on update, dumps and SNPs of hg18 data subsets 
---knownGene*, refGene* and SNP






Thanks.

What do you mean by type of SNP?  Functional assignment?
For example: coding-synon, coding-nonsynon, untranslated, intron.

I am assuming you are refering to single base substitutions only.

Let's look at a specific example.

chr22:29,807,250-29,807,500 in hg18 has 2 SNPs in the initial 
part of the 5' UTR of SMTN.   They are single base substitutions 
at rs9609221 at position 29807313 and rs11089487 at position 29807360.

Do you want the SNP specified in absolute coordinates, or relative to the 
start of the gene annotation?

Heather



On Wed, 14 Feb 2007, Fangcheng Gong wrote:

> Hi Heather,
>
> Transcript SNP data is cSNP.  I consider the SNPs, mapped onto the 
RefSeq
> or other type of transcripts, as transcript SNP.  The transcript SNP
> should consists of the transcript accession, SNP coordinates on the
> transcript, and type of SNPs.
>
> Thanks,
> Fangcheng
>
>
>
>
> Heather Trumbower <heather at soe.ucsc.edu>
> 02/14/2007 12:00 PM
>
> To
> Fangcheng Gong <Fangcheng.Gong at sial.com>
> cc
> genome at soe.ucsc.edu
> Subject
> Re: [Genome] Questions on update, dumps and SNPs of hg18 data subsets
> ---knownGene*, refGene* and SNP
>
>
>
>
>
>
> Fangcheng:
>
> Could you clarify what you mean by transcript SNP data?
>
> Thanks.
>
> Heather Trumbower
> UCSC Genome Bioinformatics Group
>
>
> On Wed, 14 Feb 2007, Fangcheng Gong wrote:
>
>> Dear UCSC colleagues,
>>
>> 1) The following data files haven't updated for long time.  Could you
>> update them at your earliest convenience?
>>
>> Directory: ftp://hgdownload.cse.ucsc.edu/goldenPath/hg18/database/
>>
>> knownGene.txt.gz        (the last update 04/13/2006)
>> knownGeneMrna.txt.gz    (the last update 04/13/2006)
>>
>> 2) Could you upload the dumps for the following files at the same
>> directory?
>>
>> refGeneMrna.sql
>> refGeneMrna.txt.gz
>>
>> 3) Do you have transcript SNP data for knownGene and refGene?  If yes,
> is
>> it possible to upload their dump at this FTP site?
>>
>>
>> Thank you very much for your time and effort.  I'm eager to hear your
>> reply.
>>
>> Regards
>> Fangcheng Gong
>>
>> _______________________________________
>> Fangcheng Gong, Ph.D.
>> Principal R&D Scientist, Bioinformatics
>>
>> Sigma-Aldrich Corporation
>> 2909 Laclede Ave.
>> St. Louis, MO 63103
>>
>> Phone:  314-289-8496 x 4464
>>        877-472-2192 x 4464
>> Fax:    314-286-7645
>> Email:  Fangcheng.Gong at sial.com
>> _______________________________________
>> _______________________________________________
>> Genome maillist  -  Genome at soe.ucsc.edu
>> http://www.soe.ucsc.edu/mailman/listinfo/genome
>>
>
>



More information about the Genome mailing list