[Genome] are the "minus" strand in the "gete data" results from the galaxy browser reverse-complimented?
Kayla Smith
kayla at soe.ucsc.edu
Mon Dec 4 16:40:08 PST 2006
Dominik,
First, there are two ways to get sequence information from our browser.
The first is putting in genomic co-ordinates and clicking the "get
DNA" button. This will always give you genomic DNA, which is positive
strand. The second way is through the table browser, via the position
information from a positional table (in this case the knownGenes table).
In this case, the data has the strand information attached to the
position, and therefore, in the case of a gene on the negative strand,
the sequence returned will be relative to that gene, or, in other words,
the reverse complement of the genomic DNA. It appears that you have
proved this to yourself from your second email.
As for your second question, let me see if I understand you clearly:
You have two bed files, one which has strand information and one which
does not. You would like to intersect these two bed files, and you are
concerned that the result of the intersection may not have strand
information in it. Intersections in the table browser do not take
strand information into consideration. You would get the same output
whether there is strand information or not.
If you are converting a BED file to FASTA format, you will get genomic
sequence if the BED file does not have strand information in it.
Likewise, if your BED file has strand information, you will get
reverse-complemented-from-the-genomic sequence, for items marked
negative. I'm not sure exactly what you will consider proper output for
your data analysis, but I hope I have explained the functionality of our
tools in a way that makes sense to you.
Please don't hesitate to contact us again if you require more assistance
with the Genome Browser.
Also, if you have any Galaxy-specific questions, they have their own
mailing list at galaxy-user at bx.psu.edu
Kayla Smith
UCSC Genome Bioinformatics Group.
Dominik Margraf wrote:
> I have done a BLAST search on one of the FASTA sequences in the "minus
> strand" and I found that it is NOT automatically reverse-complimented
> (it actually matches the minus strand of the DNA rather than the plus
> strand).
>
> However as mentioned in the last email, before I extract the FASTA
> sequences from the BED interval format, I intend to intersect that
> data with some other tasks (e.g. phastcon17way) which does not have
> strand info. Then in this case will the intersection automatically
> deal with the plus vs minus strand and is it okay just feed my BED
> file for processing, then convert the resultant BED to fasta and
> reverse compliment the negative strands of the FASTA?
>
> Thanks!
>
> Dominik
>
> 2006/12/5, Dominik Margraf <dominik.margraf at gmail.com>:
>> I am performing searches using the "get data" function from the Galaxy
>> browser (the known gene track) and the results are returned in BED
>> format, which I subsequently used to obtain the actual sequences in
>> FASTA format.
>>
>> However for the sequences in the "minus" strand, are these sequences
>> (in FASTA file) already reverse-complimented to the "plus strand"? If
>> so, then would it be okay to directly use the BED file to intersect
>> with other tracks (e.g. the phastcons17way track / BED) which do not
>> have strand information?
>>
>> Thanks!
>>
>> Dominik
>>
> _______________________________________________
> Genome maillist - Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
More information about the Genome
mailing list