[Genome] Download first intron sequences

Brooke Rhead rhead at soe.ucsc.edu
Wed Mar 7 11:52:32 PST 2007


Hello Celina,

One of our engineers has highlighted characteristics of the output from 
the Table Browser that you can use to distinguish intron number:

=====
Embedded in the very long name assigned to each intron is a sequence number:

NM_016749_intron_5_0_chr1_136014281_f
                  ^

The number just after "intron" is the sequence number which starts at
0 *for the lowest coordinates*, so she also needs to consider strand,
which is the last character -- 'f' for forward, 'r' for reverse.  If
the strand is r then she should use the intron with the highest
sequence number instead to get the first intron in the transcript.

The other numbers in the name after the intron number are the padding 
amount, the chrom, the 1-based start coordinate in case the item names 
are not unique, then the strand.
=====

I hope this helps you get the information you need.  Please let us know 
if you have any further questions.

--
Brooke Rhead
UCSC Genome Bioinformatics Group



Montemayor, Celina wrote:
> Hi, I was wondering if I could get some advice regarding how to
> download the coordinates of the frist introns of my list of genes (I
> need a BED file)? I used the table browser to paste my gene IDs and
> download intron sequences; the output I obtained looks like this:
> 
> chr1	 136014280	 136014811
> NM_016749_intron_5_0_chr1_136014281_f
> 
> Is there a way for me discriminate which are the coordinates
> corresponding to the first introns?
> 
> Thanks for your help,
> 
> Celina Montemayor, MD  Graduate Student, Pereira lab Baylor College of
> Medicine _______________________________________________ Genome
> maillist  -  Genome at soe.ucsc.edu 
> http://www.soe.ucsc.edu/mailman/listinfo/genome


More information about the Genome mailing list