[Genome] Evofold seqs conserved up to mouse?

Archana Thakkapallayil archanat at soe.ucsc.edu
Tue Apr 24 12:54:32 PDT 2007


Hello Peter,

Unfortunately, there is no direct way to get this information from our 
Browser. However, you could do an intersection between the multiz17way 
table and the evofold table on hg17, using the Table Browser and then 
then write your own program to pull out the information that you are 
looking for from the output. The evofold track was originally created on 
hg17 and was lifted to hg16 and hg18. So I suggest you to do the 
intersection from hg17. To get to this, make the following selections in 
the Table browser:

clade: vertebrate
genome: human
assembly: May 2004
group: Comparative Genomics
track: Conservation
table: multiz17way
select "genome" radio button.
click on "intersection" create button
On this page select "group: Genes and Gene prediction Tracks"; "track: 
EvoFold"; "table: evofold" and then choose "Base-pair-wise intersection 
(AND) of Conservation and EvoFold" and hit "submit" button.
Back on the main page select output format "MAF-multiple alignment 
format" and click on "get output".

The resulting output is the maf format of all 17 species used in the 
Conservation track, that have an intersection with the evofold track. 
You may want to write your own program to pull out the sequences for 
human and mouse.

Also, please note that there are cases where the evofold item is split 
across two alignment blocks in the maf file. For example, the sequence 
of evofold item: evofold: Item: 2245011_0_+_102 (Position: 
chr7:127401559-127401610) on hg17 and mm7 are:

hg17.chr7   TGTGTTACTATTTCGGGCGAGCCAGTGTTTCCAGGGGAAACCTAATGACGCA
mm7         TGTGTTACTATTTCGGGCGAGCCAGTGTTTCCAGGGGAAACCTAATGACGCA

But, when you get the output from the Table browser, this is actually 
split into two parts as follows:
s hg17.chr7               127401558 33 + 158628139 
TGTGTTACTATTTCGGGCGAGCCAGTGTTTCCA
s mm7.chr6                 28913543 33 + 149646834 
TGTGTTACTATTTCGGGCGAGCCAGTGTTTCCA
=======================================
s hg17.chr7               127401591 19 + 158628139 GGGGAAACCTAATGACGCA
s mm7.chr6                 28913576 19 + 149646834 GGGGAAACCTAATGACGCA

You can also get the details of the multiple species alignments and RNA 
secondary structure annotation by clicking on the individual evofold 
item from the browser.

I hope this information helps you. Please be sure to write back if you 
need further information.

Regards,

Archana
UCSC Genome Bioinformatics Group

 
Pete Shepard wrote:
> Dear UCSC Browser Folks,
>
> I would like to export a table of Evofold human sequences that are conserved
> in mouse. Can you tell me how to go about this?
>
> Thanks,
>
> Peter
> _______________________________________________
> Genome maillist  -  Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
>   


More information about the Genome mailing list