[Genome] non-redundant set of refseq or knownGenes exons

Khrebtukova, Irina ikhrebtukova at illumina.com
Tue Aug 7 16:44:10 PDT 2007


Hi Archana,

my problem is that I do know now how to make a custom track and how to
get a sequence from custom track. I was wondering (and I guess it's now
a FAQ for you guys) how to get NON_REDUNDANT set of exons? Preferably of
course dealing with overlapping exons too. Like selecting largest exon
of all overlapping.

ok, I guess it's still not that easy solution. But my prediction would
be that more and more people would ask this question...

thanks! and of course I just LOVE your browser! really it's the best
despite of my minor desires of making even better.

Irina

-----Original Message-----
From: Archana Thakkapallayil [mailto:archanat at soe.ucsc.edu] 
Sent: Tuesday, August 07, 2007 4:16 PM
To: Khrebtukova, Irina
Cc: genome at soe.ucsc.edu
Subject: Re: [Genome] non-redundant set of refseq or knownGenes exons

Hello Irina,

One of my colleagues pointed out to me that you could get this
information by first making a Custom Track using the Table Browser of
all the KnownGene exons. You could then output your custom track in a
BED file and then do some PERL programming to get the unique exons.

1. Creating a custom track of exons.

Set the following options in the Table Browser:
   clade: Vertebrate
   genome: Human
   assembly: Mar 2006
   group: Genes and Gene Prediction Tracks
   track:  UCSC Genes
   table: knownGene
   region: genome
   output format: custom track
   Click "get output". On the next page, select the radio button for 
"Coding Exons" and press "get custom track in table     browser". You 
now have a custom track of the exons of the Known Genes.

2. Back on the Table Browser main page, now choose "Custom Tracks" as
the group and the corresponding track and table. Then choose 'output
format: BED' and hit 'get output'. On this page select the radio button
for 'Create one BED record per: Whole Gene" and press "get BED". This
gives you the coordinates of all the coding exons in the BED format. 
More information on using the Table Browser is here:
http://genome.ucsc.edu/goldenPath/help/hgTablesHelp.html

I hope that this helps you. Please let us know if you have further
questions.

Regards,

Archana
UCSC Genome Bioinformatics Group


Khrebtukova, Irina wrote:
> Is there an easy way to get non-redundant set of all refseq and/or 
> knownGenes exons? I mean if the same exon is shared by two or more 
> transcripts it is redundant.
>
> I think this or similar question was already asked. Just can not find 
> the answer.
>
> thanks!
>
> Irina Khrebtukova, PhD
> Sr. Staff Bioinformatics Scientist
> Illumina Inc.
> 25861 Industrial Blvd.,
> Hayward, CA 94545
> ph: 510-723-9219
> ikhrebtukova at illumina.com
>  
>
> _______________________________________________
> Genome maillist  -  Genome at soe.ucsc.edu 
> http://www.soe.ucsc.edu/mailman/listinfo/genome
>   




More information about the Genome mailing list