[Genome] obtaining multiz17way alignments batch wise
Rachel Harte
hartera at soe.ucsc.edu
Tue Oct 10 15:24:43 PDT 2006
Robert.
You can obtain the MAF format file through the Table Browser for all the
species in the multiz17way alignment starting with a list of human
coordinates. First you will need to make a custom track for the
human coordinates and then you will need to intersect the multiz17way
track with the custom track.
To create a custom track, click on the "add custom tracks" button on
the Gateway page or below the Browser image which will take you to here:
http://genome.ucsc.edu/cgi-bin/hgCustom?
Here there are instructions for making a custom track. The easiest way to
do it would be to make a BED format list of the human coordinates. BED
format is described here:
http://genome.ucsc.edu/FAQ/FAQformat#format1
Note that all start coordinates are 0-based for BED format so base
position 1 is 0.
Once you have created the track, you will be taken to a custom tracks
management page. Click on the button "access in table browser".
Now you are ready to do the Table Browser intersection:
1) On the Table Browser and select human and the assembly of interest.
2) Select Comparative Genomics as the group, Conservation as the track and
multiz17way as the table.
3) Make sure that you have genome selected as the region.
4) Click on the create button for intersection and select Custom Tracks as
the group and select the custom track that you created.
Also select "Base-pair-wise intersection (AND) of Conservation and User
Track". This will make sure that you get only the regions of the
alignments that are in the regions defined in your custom track. The
Press submit.
5) Select MAF as the output format and press "get output".
The result should be those regions of the multiz17way alignment that
are in the regions specified by the set of human coordinates in your
custom track.
I hope that this helps you. Please let us know if you have further
questions.
Rachel
On Tue, 10 Oct 2006, Robert
Querfurth wrote:
> Hello,
>
> is it possible via the Table Browser to obtain, starting from a list of
> human coordinates, the orthologous sequences in fasta or MAF format for
> all available species?
> and if so, how.
>
> thank you very much,
> Robert Querfurth
>
> ---
> Department for Vertebrate Genomics
> Max Planck Institute for Molecular Genetics
> Ihnestrasse 73
> 14195 Berlin
> Germany
>
> +49 30 8413 1225
> _______________________________________________
> Genome maillist - Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
>
--
Rachel Harte
UCSC Genome Bioinformatics Group
http://genome.ucsc.edu
More information about the Genome
mailing list