[Genome] Finding alternative terminal exons .......

Rileen rileen at gmail.com
Sun Sep 9 12:48:36 PDT 2007


Hi,
       Thanks for that, I now have the list of all the exons in
UCSC genes with altFinish entries.

I just need to put this together with a table giving the info on gene
names/symbols, so that I know all the different transcripts of a
given gene, and can check for different terminal exons accordingly,
i.e something similar to the "refFlat" table for RefSeq data.

Thanks once again,
                                 Yours,
                                               Rileen

On 07/09/2007, Ann Zweig <ann at soe.ucsc.edu> wrote:
> Hello Rileen,
>
>         Thanks for the compliments on the browser.
>
>         You are correct that you are not going to find what you want directly
> from the altFinish items in the Alt Events track.  However, you can use
> that track as a starting point to get what you need.  Below I outline
> the steps you will need to take to find the ending exon for each transcript.
>
> 1. Make a custom track of the altFinish items from the Alt Events track.
>
> Navigate to the Table Browser ('Tables' from the top blue navigation
> bar) and create a custom track from the Alt Events track.  Be sure to
> configure the filter to include on altFinish items from the table:
> "name does match altFinish"
>
> Read more about creating custom tracks using the table browser here:
> http://genome.ucsc.edu/goldenPath/help/hgTablesHelp.html#CustomTrack
>
>
> 2. Intersect that custom track with the UCSC Known Gene track (to make a
> new CT).
>
> Again, using the Table Browser, intersect the UCSC Genes track with your
> custom track from step 1.  Create a new custom track.  This will be a
> track containing all UCSC Genes that have overlap with altFinish items.
>
> Read more about doing intersections with the Table Browser here:
> http://genome.ucsc.edu/goldenPath/help/hgTablesHelp.html#Intersection
>
>
> 3. Get the exons from the resulting track.
> Choose the resulting custom track from step 2 in the Table Browser.
> Choose BED as the output type.  From the next page, choose only the
> "Exons" button.  Press 'get BED'.  This will give you a list (in BED
> format) of each exon in each UCSC Gene from the custom track in step 2.
>
>
> 4. Mine the output for the location of the "last" exon.
> The BED output will have the following columns:
> chromosome
> chromStart
> chromEnd
> name
> score
> strand
>
> Here's an example of the 9 exons from one gene in the track:
>
> chr1    41748270        41749524        uc001cgz.1_exon_0_0_chr1_41748271_r     0       -
> chr1    41751073        41752008        uc001cgz.1_exon_1_0_chr1_41751074_r     0       -
> chr1    41756659        41756746        uc001cgz.1_exon_2_0_chr1_41756660_r     0       -
> chr1    41762992        41763168        uc001cgz.1_exon_3_0_chr1_41762993_r     0       -
> chr1    41813801        41813947        uc001cgz.1_exon_4_0_chr1_41813802_r     0       -
> chr1    41817994        41823576        uc001cgz.1_exon_5_0_chr1_41817995_r     0       -
> chr1    41867006        41867205        uc001cgz.1_exon_6_0_chr1_41867007_r     0       -
> chr1    41939173        41939253        uc001cgz.1_exon_7_0_chr1_41939174_r     0       -
> chr1    42156670        42156782        uc001cgz.1_exon_8_0_chr1_42156671_r     0       -
>
>         Here's how to interpret the name field:
>
> name: uc001cgz.1_exon_0_0_chr1_41748271_r
>
> uc001cgz.1      gene name
> exon            part of gene (all exon in your case)
> 0               exon number
> 0               score
> chr1            chromosome
> 41748271        chromStart
> r               strand (r = reverse, f = forward)
>
>         So, in the example above, there are 9 exons for gene uc001cgz.1.
> Because it is on the reverse strand, you want exon 0.  Conversely, if it
> were on the forward strand, you'd be looking for the exon with the
> highest number.
>
>         So, for this transcript, the location of the ending exon is:
> chr1:41748270-41749524
>
>
>         This should get you well on your way.  Be sure to write back to the
> list if you get stuck or need more direction.
>
>
> Regards,
>
> ----------
> Ann Zweig
> UCSC Genome Bioinformatics Group
> http://genome.ucsc.edu
>
>
>
> Rileen wrote:
> > Hi,
> >       Thanks once again for the great resource you provide :-)
> >
> > I've played around with the "knownAlt" track a bit, and was wondering
> > whether there's any simple way of deriving a list of alternative terminal
> > exons (ATEs) from it, or any other table/track. By this I mean instances
> > where the gene has transcripts ending in different exons, not merely
> > different positions in the same exon.
> >
> > For all the other events in the knownAlt track, the two positions seem
> > to provide useful information, but for the altFinish category, they always
> > seem to differ by 1.
> >
> > Why is this so?
> >
> > I was hoping that given the two positions, one cold check whether they
> > were in two
> > different exons to derive the list of ATEs as a subset of altFinish,
> > but that seems
> > wrong.
> >
> > Looking forward to your reply,
> >                                               Yours,
> >                                                              Rileen
> >
>


--
******************************************************************
"I know nothing, but i _know_ that."

Rileen Sinha                 rileen at yahoo.com
Personal Phone : (0049)3641412276   (cheaper to call)
                         (0049)17624078373
******************************************************************


-- 
******************************************************************
"I know nothing, but i _know_ that."

Rileen Sinha                 rileen at yahoo.com
Personal Phone : (0049)3641412276   (cheaper to call)
                         (0049)17624078373
******************************************************************


More information about the Genome mailing list