[Genome] non-redundant set of refseq or knownGenes exons

Archana Thakkapallayil archanat at soe.ucsc.edu
Tue Aug 7 16:15:54 PDT 2007


Hello Irina,

One of my colleagues pointed out to me that you could get this 
information by first making a Custom Track using the Table Browser of 
all the KnownGene exons. You could then output your custom track in a 
BED file and then do some PERL programming to get the unique exons.

1. Creating a custom track of exons.

Set the following options in the Table Browser:
   clade: Vertebrate
   genome: Human
   assembly: Mar 2006
   group: Genes and Gene Prediction Tracks
   track:  UCSC Genes
   table: knownGene
   region: genome
   output format: custom track
   Click "get output". On the next page, select the radio button for 
"Coding Exons" and press "get custom track in table     browser". You 
now have a custom track of the exons of the Known Genes.

2. Back on the Table Browser main page, now choose "Custom Tracks" as 
the group and the corresponding track and table. Then choose 'output 
format: BED' and hit 'get output'. On this page select the radio button 
for 'Create one BED record per: Whole Gene" and press "get BED". This 
gives you the coordinates of all the coding exons in the BED format. 
More information on using the Table Browser is here:
http://genome.ucsc.edu/goldenPath/help/hgTablesHelp.html

I hope that this helps you. Please let us know if you have further 
questions.

Regards,

Archana
UCSC Genome Bioinformatics Group


Khrebtukova, Irina wrote:
> Is there an easy way to get non-redundant set of all refseq and/or
> knownGenes exons? I mean if the same exon is shared by two or more
> transcripts it is redundant.
>
> I think this or similar question was already asked. Just can not find
> the answer.
>
> thanks!
>
> Irina Khrebtukova, PhD
> Sr. Staff Bioinformatics Scientist
> Illumina Inc.
> 25861 Industrial Blvd.,
> Hayward, CA 94545
> ph: 510-723-9219
> ikhrebtukova at illumina.com
>  
>
> _______________________________________________
> Genome maillist  -  Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
>   



More information about the Genome mailing list