[Genome] Not the same assemblies in the browser and in the downloadable sequence files ?
Brooke Rhead
rhead at soe.ucsc.edu
Fri Mar 9 13:13:53 PST 2007
Hello Herve,
I tried downloading the dm2 fasta sequence located at:
http://hgdownload.cse.ucsc.edu/goldenPath/dm2/bigZips/chromFa.zip
I unzipped the chromFa.zip file and looked at chrX.fa. The first line
of that file looks consistent with the sequence in the Browser:
>chrX
tagagcaaaaaatagacattttaatggcgctaatcatacaaggaaggaat
Which is not the same as what you are seeing. As far as I can tell,
everything is correct on our download site. I suggest that you
make a new empty directory and re-download and unzip the chromFa.zip
file there. If you still see the incorrect sequence, please let us know.
--
Brooke Rhead
UCSC Genome Bioinformatics Group
Herve Seitz wrote:
> Dear developers,
>
> I noticed a difference between the genomic sequences for Drosophila
> melanogaster chromosome X on the browser and in the sequence file
> that I downloaded from http://hgdownload.cse.ucsc.edu/goldenPath/dm2/
> bigZips/ :
> according to the browser, the first 30 nucleotides of melanogaster
> chromosome X are: TAGAGCAAAAAATAGACATTTTAATGGCGC (cf http://
> genome.ucsc.edu/cgi-bin/hgTracks?
> hgsid=87534567&hgt.dummyEnterButton.x=0&hgt.dummyEnterButton.y=0&clade=i
> nsect&org=D.+melanogaster&db=dm2&position=chrX%
> 3A1-30&pix=620&hgsid=87534567 )
> but according to chrX.fa (in the archive chromFa.zip, downloaded from
> http://hgdownload.cse.ucsc.edu/goldenPath/dm2/bigZips/ ), the first
> 30 nucleotides of that chromosome are: CAACATTAGCGCCATGCCCACTGTGGGGAA.
>
> When I search this sequence in the "browser" version of the
> chromosome (with BLAT), I can see that it starts on bp 6547 of
> chromosome X; reciprocally, when I look for
> TAGAGCAAAAAATAGACATTTTAATGGCGC in chrX.fa, I find it on the 164th
> line of the file (hence, at position ~8100).
>
> So it seems that the downloadable sequence for D. melanogaster
> chromosome X, and the browser version, do not result from the same
> assembly. Is is possible to download the "browser" version of the
> assembly ? It seems that chromosome X is not the only one - chr2R
> also differs between the downloadable file and the browser version ;
> the beginning of chr2L seems to be identical in both versions, but
> the total length is different: 22,407,834 bp according to the
> browser, and 23,471,775 in chr2L.fa.
>
> Thanks in advance,
>
> Hervé
>
> --
> Hervé Seitz, PhD
> Department of Biochemistry and Molecular Pharmacology
> University of Massachusetts Medical School
> Lazare Research Building, Room 870P
> 364 Plantation Street
> Worcester, MA 01605-2324
> USA
> telephone: (508) 856 1220
>
>
>
> _______________________________________________
> Genome maillist - Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
More information about the Genome
mailing list