[Genome] Multiple genome alignment
Brooke Rhead
rhead at soe.ucsc.edu
Wed Oct 10 19:15:07 PDT 2007
Hello Jyoti,
Sequence is not actually stored in the chain and net tables, so the
Table Browser can't retrieve that information from the mouse chain
track. Also, when you click on the mouse chain track in the Genome
Browser, information about only the current region selected in the
Genome Browser window is displayed; this type of info also cannot be
retrieved from the mouse chain track in the Table Browser.
Another solution is to use the "Conservation" track, and the table
'multiz28way' (for hg18). The table contains pointers to a multiple
alignment file (described here:
http://genome.ucsc.edu/FAQ/FAQformat#format5 -- be sure to note the way
coordinates on the negative strand are handled), which contains
sequences for all 28 genomes in the alignment. The Table Browser has a
special output format available for this table: "MAF - multiple
alignment format". This output format returns the aligned sequences the
region you specified in the Table Browser.
The multiple alignment in the conservation track is based on the chain
and net tracks. Click on the "Conservation" track title above the track
control to read about the methods used to create the track.
If you would like to parse out sequence for only certain organisms from
the MAF, there is a set of tools run by Penn State that can do this,
located here:
http://main.g2.bx.psu.edu/
Look under the "Fetch Alignments" link on the left-hand side of the page.
I hope this helps. If you have further questions, please feel free to
contact us again.
--
Brooke Rhead
UCSC Genome Bioinformatics Group
Shah, Jyoti K. wrote:
> Dear UCSC team,
>
> I have a question regarding your conservation track. I want to extract
> the genomic region in mouse, rat and other species which align with the
> corresponding genomic region in human species.
>
> When I use the human genome browser and Mouse chain track, I can see the
> chromosome and coordinates in mouse species for certain human genomic
> region. But I am unable to get the same information when I try the table
> browser. Ideally, I would like to have a table which gives the following
> fields:
>
> Human chr
> Human genomic start
> Human genomic end
> Human sequence
> Mouse chr
> Mouse genomic start
> Mouse genomic end
> Mouse sequence
>
> Is it possible to extract the above using your table browser? It would
> be great if you could help me with this.
>
> Thanks
> Jyoti
>
>
> ------------------------------------------------------------------------------
> Notice: This e-mail message, together with any attachments, contains
> information of Merck & Co., Inc. (One Merck Drive, Whitehouse Station,
> New Jersey, USA 08889), and/or its affiliates (which may be known
> outside the United States as Merck Frosst, Merck Sharp & Dohme or MSD
> and in Japan, as Banyu - direct contact information for affiliates is
> available at http://www.merck.com/contact/contacts.html) that may be
> confidential, proprietary copyrighted and/or legally privileged. It is
> intended solely for the use of the individual or entity named on this
> message. If you are not the intended recipient, and have received this
> message in error, please notify us immediately by reply e-mail and then
> delete it from your system.
>
> ------------------------------------------------------------------------------
> _______________________________________________
> Genome maillist - Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
More information about the Genome
mailing list