[Genome] downloaded exons
Angie Hinrichs
angie at soe.ucsc.edu
Thu Mar 6 15:54:47 PST 2008
Hi Chris,
Here's a small addition to Kayla's explanation. You might be
wondering about the extra _0 following the exon number:
> 2. exon_0_0 is the number of the exon. You'll note that the next exon
> is labeled "exon_1_0".
-- I was! I had to look it up in the code. It is the number of extra
bases appended to each end of the exon. In your case (and in most
cases I think), 0 bases were appended so you're getting just the
exons.
It comes from the Table Browser's "Output track as BED" options page:
Create one BED record per:
...
(*) Exons plus [0] bases at each end
Hope that helps,
Angie
On Thu, 6 Mar 2008, Kayla Smith wrote:
>
> Hello Chris,
>
> This is best described with an example. I shall choose the "CITED2"
> gene on the hg18 assembly. For your reference, this gene is at this
> position: chr6:139,735,090-139,737,478. Please note that this gene is
> on the reverse strand, which you can see from the <<< arrows on the
> display of the gene.
>
> Once I have this gene in view in the Genome Browser, I click on "Tables"
> to go to the Table Browser. I use the following settings:
>
> clade: Vertebrate; genome: Human; assembly: Mar. 2006
> group: Genes and Gene Prediction Tracks; track: UCSC Genes;
> table: knownGene; position: chr6:139,735,090-139,737,478
> output format: BED.
>
> Click "Get output" and on the next page select the radio button next to
> "Exons", and click "Get BED".
>
> Here are the 2 lines of output:
>
> chr6 139735089 139736782 uc003qip.1_exon_0_0_chr6_139735090_r
> 0 -
> chr6 139737242 139737478 uc003qip.1_exon_1_0_chr6_139737243_r
> 0 -
>
>
> Looking at: uc003qip.1_exon_0_0_chr6_139735090_r
>
> Here we have:
>
> 1. uc003qip.1 is the name of the UCSC gene
> 2. exon_0_0 is the number of the exon. You'll note that the next exon
> is labeled "exon_1_0".
> 3. chr6_139735090 is the starting position of that exon
> 4. r means "reverse strand"
>
> I hope this information is helpful to you. Please don't hesitate to
> contact us again if you require further assistance.
>
> Kayla Smith
> UCSC Genome Bioinformatics Group
>
>
>
> Chris Zaleski wrote:
> > Greetings,
> >
> > I have a question about downloaded BED files from the table browser. I've
> > chosen a 'Gene' track, BED output format, and then Exons (from the 2nd
> > page). An example 'name' field looks like the following:
> >
> > NM_152486_exon_0_0_chr1_850984_f
> >
> > I understand most, but not all, of the tokens in this string. Could you
> > please explain what all the items represent?
> >
> > Thanks very much,
> > Chris Zaleski
> > Bioinformatics Team Lead
> > Tenenbaum Lab - SUNY Albany
> > _______________________________________________
> > Genome maillist - Genome at soe.ucsc.edu
> > http://www.soe.ucsc.edu/mailman/listinfo/genome
>
> _______________________________________________
> Genome maillist - Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
>
--
angie at soe.ucsc.edu
Software Developer, UCSC CBSE / Genome Bioinformatics Group
More information about the Genome
mailing list