[Genome] Difference between Genome Browser and Table Browser output
Brooke Rhead
rhead at soe.ucsc.edu
Fri Sep 14 16:23:02 PDT 2007
Hello Ko,
You did not do anything wrong, there is just a difference in formats in
these two locations. The Table Browser outputs conservation data in MAF
format, described here:
http://genome.ucsc.edu/FAQ/FAQformat#format5
The numbering of coordinates on the negative strand in MAF files is
relative to the other end of the chromosome (that is, it is relative to
the reverse-complement of the positive strand). So, to calculate Genome
Browser coordinates from MAF coordinates, you need to subtract each
start that occurs on the negative strand from the total chromosome size.
For example, the first conservation track output line for rat in the
Table Browser (in the human (hg17) region chr12:64912201-64912645) looks
like this:
s rn3.chr7 83636219 54 - 143082968 TCATT...
Which corresponds to:
source: rn3.chr7
start: 83636219
size: 54
strand: -
source size: 143082968
To convert these numbers to Genome Browser coordinates, first subtract
the start from the source size:
143082968 - 83636219 = 59446749
This is the right-most coordinate displayed in the browser. To get the
left-most coordinate, subtract 54 from this number:
59446749 - 54 = 59446695
This is almost the exact same coordinate you see in the Genome Browser.
However, because MAF coordinates are zero-based and Browser
coordinates are one-based, you need to add 1 to the start coordinate, so
the final coordinate range becomes:
chr7:59446696-59446749
which matches the first alignment block in the shown in the Genome Browser.
I hope this explanation helps. If you have further questions, please do
not hesitate to contact us again.
--
Brooke Rhead
UCSC Genome Bioinformatics Group
Pang Ko wrote:
> Hi,
>
> I am interested in the location chr12:64912201-64912645 in human genome (hg17)
>
> I am using the Genome Browser with
> Genome: Human
> Assembly: May 2004
> and choose Track Conservation to be full. Then I click on the
> conservation track and then choose rat in the multiple alignment, it
> will bring me to rat genome location chr7:59446696-59,444,569 (rn3).
>
> When I use Table Browser and choose
> Genome: Human
> Assembly: May 2004
> Group: Comparative Genomics
> Track: Conservation
> table: multiz17way
> Position: chr12:64912201-64912645
>
> The output of table browser shows an alignment with rn3 location
> chr7:83636219-83638448
>
> My question is why are the two output different? (one shows
> alignment with chr7:59446696-59444569 and the other shows
> chr7:83636219-83638448 in the same assembly) Did I do something wrong?
>
> Thank you,
> Ko.
> _______________________________________________
> Genome maillist - Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
More information about the Genome
mailing list