[Genome] Not the same assemblies in the browser and in the downloadable sequence files ?
Herve Seitz
herve.seitz at umassmed.edu
Wed Mar 7 16:20:28 PST 2007
Dear developers,
I noticed a difference between the genomic sequences for Drosophila
melanogaster chromosome X on the browser and in the sequence file
that I downloaded from http://hgdownload.cse.ucsc.edu/goldenPath/dm2/
bigZips/ :
according to the browser, the first 30 nucleotides of melanogaster
chromosome X are: TAGAGCAAAAAATAGACATTTTAATGGCGC (cf http://
genome.ucsc.edu/cgi-bin/hgTracks?
hgsid=87534567&hgt.dummyEnterButton.x=0&hgt.dummyEnterButton.y=0&clade=i
nsect&org=D.+melanogaster&db=dm2&position=chrX%
3A1-30&pix=620&hgsid=87534567 )
but according to chrX.fa (in the archive chromFa.zip, downloaded from
http://hgdownload.cse.ucsc.edu/goldenPath/dm2/bigZips/ ), the first
30 nucleotides of that chromosome are: CAACATTAGCGCCATGCCCACTGTGGGGAA.
When I search this sequence in the "browser" version of the
chromosome (with BLAT), I can see that it starts on bp 6547 of
chromosome X; reciprocally, when I look for
TAGAGCAAAAAATAGACATTTTAATGGCGC in chrX.fa, I find it on the 164th
line of the file (hence, at position ~8100).
So it seems that the downloadable sequence for D. melanogaster
chromosome X, and the browser version, do not result from the same
assembly. Is is possible to download the "browser" version of the
assembly ? It seems that chromosome X is not the only one - chr2R
also differs between the downloadable file and the browser version ;
the beginning of chr2L seems to be identical in both versions, but
the total length is different: 22,407,834 bp according to the
browser, and 23,471,775 in chr2L.fa.
Thanks in advance,
Hervé
--
Hervé Seitz, PhD
Department of Biochemistry and Molecular Pharmacology
University of Massachusetts Medical School
Lazare Research Building, Room 870P
364 Plantation Street
Worcester, MA 01605-2324
USA
telephone: (508) 856 1220
More information about the Genome
mailing list