[Genome] short mm8 chr1 and 2?

Rachel Harte hartera at soe.ucsc.edu
Mon Apr 30 18:59:49 PDT 2007


Hello Ben,

It may be that if you are using Windows and http then the files are not
being completely downloaded. Try using wget and ftp instead e.g. for chr1:
wget
'ftp://hgdownload.cse.ucsc.edu/apache/htdocs/goldenPath/mm8/chromosomes/chr1.fa.gz'
-O chr1.fa.gz

--18:54:24--
ftp://hgdownload.cse.ucsc.edu/apache/htdocs/goldenPath/mm8/chromosomes/chr1.fa.gz
           => `chr1.fa.gz'
Resolving hgdownload.cse.ucsc.edu... 128.114.119.140
Connecting to hgdownload.cse.ucsc.edu|128.114.119.140|:21... connected.
Logging in as anonymous ... Logged in!
==> SYST ... done.    ==> PWD ... done.
==> TYPE I ... done.  ==> CWD /apache/htdocs/goldenPath/mm8/chromosomes
... done.
==> PASV ... done.    ==> RETR chr1.fa.gz ... done.
Length: 62,819,741 (60M) (unauthoritative)

100%[====================================>] 62,819,741    13.69M/s    ETA
00:00

18:54:28 (14.16 MB/s) - `chr1.fa.gz' saved [62819741]

then
>sum chr1.fa.gz
52962 61348

For chr2:
--18:58:03--
ftp://hgdownload.cse.ucsc.edu/apache/htdocs/goldenPath/mm8/chromosomes/chr2.fa.gz
           => `chr2.fa.gz'
Resolving hgdownload.cse.ucsc.edu... 128.114.119.140
Connecting to hgdownload.cse.ucsc.edu|128.114.119.140|:21... connected.
Logging in as anonymous ... Logged in!
==> SYST ... done.    ==> PWD ... done.
==> TYPE I ... done.  ==> CWD /apache/htdocs/goldenPath/mm8/chromosomes
... done.
==> PASV ... done.    ==> RETR chr2.fa.gz ... done.
Length: 58,461,619 (56M) (unauthoritative)

100%[====================================>] 58,461,619    14.54M/s    ETA
00:00

18:58:07 (14.51 MB/s) - `chr2.fa.gz' saved [58461619]

then
> sum chr2.fa.gz
24876 57092

If your result does not look like the output above then the file is not
being downloaded completely.

I hope that this helps you. Please let us know if you have further
questions.

Rachel

Rachel Harte
UCSC Genome Bioinformatics Group
http://genome.ucsc.edu


On Mon, 30 Apr 2007, Ben Gantner wrote:

> Hi there:
>
> Sorry for the bother but I've tried repeatedly to download
> the mm8 release of the mouse genome from your site. I'm
> downloading gzipped files from the following page:
> http://hgdownload.cse.ucsc.edu/goldenPath/mm8/chromosomes/
>
> When I decompress the files I count the total sequence length
> per chromosome (minus headers) and I get back the correct
> lengths for most of the chromosomes as described on your
> site:
> http://genome.ucsc.edu/cgi-bin/hgTracks?hgsid=86540626&chromInfoPage=
>
> However, chr1 and chr2 show up short, and I'm missing a
> significant amount of sequence:
> chr1 get : 115212860, expect : 197,069,962
> chr2 get : 102088474, expect : 181,976,762
>
> I've looked through the site FAQ and can't find anything
> about this, any suggestions/ideas??
>
> Thanks so much,
> Ben
> _________________________
> Ben Gantner
> University of Chicago
> Singh Lab, CIS Rm W519
> 929 East 57th Street
> Chicago, IL  60637
> (Ph) 773.702.2912
> _______________________________________________
> Genome maillist  -  Genome at soe.ucsc.edu
> http://www.soe.ucsc.edu/mailman/listinfo/genome
>


More information about the Genome mailing list