[Genome] CpG islands and TSS

Brooke Rhead rhead at soe.ucsc.edu
Mon Dec 10 18:24:22 PST 2007


Hello Ilya,

Information on CpG islands is in the table 'cpgIslandExt', which is 
available from the Table Browser (hit the "Tables" link in the blue bar 
at the top of the page and look under the Expression and Regulation 
group for the CpG Islands track), or from the downloads page:
http://hgdownload.cse.ucsc.edu/downloads.html (go to the appropriate 
assembly, and then click the "annotation database" link and look for the 
  file 'cpgIslandExt.txt.gz').

The island length is in the 'length' field of the table, the field 
'perCg' contains the percentage of the island that is C or G (and the 
field 'numCg' contains the count of C and G in the island), and the 
observed/expected ratio is in the field 'obsExp'.

Finding the distance to the nearest transcription start site will be 
more difficult.  Depending on the assembly you are using, there may or 
may not be a TSS track available.  If there is, you will need to use 
your own tools to find the nearest TSS to a CpG Island and to calculate 
the distance between them.  There might be a tool at the Galaxy web site 
(http://main.g2.bx.psu.edu/ ; this site is run by Penn State) that might 
be useful for this.

I hope this information helps.  If you have further questions, please 
feel free to contact us again at this mailing list address.

--
Brooke Rhead
UCSC Genome Bioinformatics Group


Ioschikhes, Ilya wrote:
> Hello,
>  
> Please let me know how could I get following information for known CpG
> islands:
>  
> Length;    C,G content;   Observed/Expected CpG ratio;   Distance from
> nearest TSS.
>  
> Thanks,
>  
> 
> Ilya Ioshikhes, Ph.D.
> 
> Assistant Professor
> 
> Department of Biomedical Informatics and
> 
> Department of Molecular & Cellular Biochemistry,
> 
> Associate Investigator 
> 
> Davis Heart and Lung Research Institute,
> Ohio State University
> 3172c Graves Hall
> 333 W. 10th Ave.
> Columbus, OH 43210
> TEL: +1 (614) 292-8929
> Fax: +1 (614) 688-6600
> E-mail: Ilya.Ioschikhes at osumc.edu <mailto:Ilya.Ioschikhes at osumc.edu> 
> 
> _______________________________________________
> Genome maillist  -  Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome


More information about the Genome mailing list