[Genome] Custom Annotations locally

Brooke Rhead rhead at soe.ucsc.edu
Thu Jun 7 12:49:34 PDT 2007


Hi Vinayak,

I think I understand your question better now.  We don't actually have 
any XMs annotated in the Genome Browser.  One of our developers suggests 
using the N-SCAN or AUGUSTUS tracks (AUGUSTUS is only available on hg17) 
for high-quality gene predictions.  With those you can use the procedure 
I described.

If you specifically want the XMs, however, you can BLAT them (as you 
suggested in your original question).  You can then turn your BLAT 
results into a custom track in order to view the transcription starts 
and ends and fetch sequence.  To do this, set the BLAT output type to 
'psl no header', then copy and paste the row(s) of interest into the 
custom track input.  (Hit the 'add custom tracks' button.  PSL is one of 
the accepted file formats.)  If you do this with the mRNA sequence, you 
will see the transcribed region of the gene prediction.  You will need 
to repeat the process with the protein sequence to see the translated 
region of the gene prediction.

Once you have custom tracks of translated and transcribed regions, you 
can use the intersection tool in the Table Browser to get the 5' and 3' 
UTRs.  You can choose to output the results of the intersection as a 
third custom track, or just output the sequence directly from the 
intersection.

I hope this solution works for you.  Let us know if you need 
clarification on any of the above steps.

--
Brooke Rhead
UCSC Genome Bioinformatics Group



Vinayak Kulkarni wrote:
> Thank you for your email. How do I get annotations for predicted genes 
> (XM's)? You have all the validated genes there. Is there a way to get 
> the predicted genes too?
> Thanks again,
> Kind regards,
> Vinayak.
> 
> On 6/7/07, *Brooke Rhead* <rhead at soe.ucsc.edu 
> <mailto:rhead at soe.ucsc.edu>> wrote:
> 
>     Hello Vinayak,
> 
>     The Table Browser will allow you to extract sequence for 5' UTRs and 3'
>     UTRs in our gene prediction tracks.  Just choose that particular track
>     in the the Table Browser (the "tables" link at the top of the page),
>     enter the position of interest or some gene identifiers, and choose
>     "output format: sequence".  If you choose to output the genomic
>     sequence, you will see options for selecting the 5' or 3' UTRs, and for
>     adding bases upstream and downstream of the selected gene regions.
> 
>     More detailed information is available in our Table Browser User's
>     Guide:
> 
>     http://genome.ucsc.edu/goldenPath/help/hgTablesHelp.html
> 
>     Specifically, see the instruction on displaying sequence data:
> 
>     http://genome.ucsc.edu/goldenPath/help/hgTablesHelp.html#Sequence
>     <http://genome.ucsc.edu/goldenPath/help/hgTablesHelp.html#Sequence>
> 
>     I hope this answers your question.  If not, please feel free to write
>     back to this list.
> 
>     --
>     Brooke Rhead
>     UCSC Genome Bioinformatics Group
> 
> 
> 
>     Vinayak Kulkarni wrote:
>      > Hi,
>      > I wanted to provide each gene annotations such as it's 5' UTR, 3'
>     UTR, be
>      > able to pull out information upstream/downstream say Xkb as per the
>      > requirement. I know you guys have all this data on your browser
>     tables but I
>      > cannot find data annotated for predicted genes.
>      > So essentially want I want is a way to annotate for a given gene
>     (including
>      > predicted genes), the genomic region (BLAT like o/p) with introns
>     and exons
>      > and be able to extract the 5' & 3' UTR along with given length of
>     upstream
>      > or downstream sequences. Can you please advice?
>      > Thank you in advance,
>      > Kind regards,
>      > Vinayak.
>      >
>      >
> 
> 
> 
> 
> -- 
> --
> "Hesitating to act because the whole vision might  not be achieved, or 
> because others do not yet share it, is an attitude that  only hinders
> progress"... Mahatma Gandhi


More information about the Genome mailing list