uber / kraken

P2P Docker registry capable of distributing TBs of data in seconds
Apache License 2.0
6.14k stars 423 forks source link

Unable to download the kraken database #314

Open mathavanpu opened 2 years ago

mathavanpu commented 2 years ago

Hello I am trying to download the database using the command

(metawrap_prokka) [suresh@node4 2.Clean]$ kraken-build --standard --threads 50 --db standard

Error Message: Found jellyfish v1.1.12 --2021-12-20 10:44:08-- ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/accession2taxid/nucl_gb.accession2taxid.gz => ‘nucl_gb.accession2taxid.gz’ Resolving ftp.ncbi.nlm.nih.gov... 130.14.250.7, 130.14.250.10, 2607:f220:41f:250::228, ... Connecting to ftp.ncbi.nlm.nih.gov|130.14.250.7|:21... connected. Logging in as anonymous ... Logged in! ==> SYST ... done. ==> PWD ... done. ==> TYPE I ... done. ==> CWD (1) /pub/taxonomy/accession2taxid ... done. ==> SIZE nucl_gb.accession2taxid.gz ... 2136404399 ==> PASV ... done. ==> RETR nucl_gb.accession2taxid.gz ... done. Length: 2136404399 (2.0G) (unauthoritative)

nucl_gb.accession2taxid.gz 100%[================================================================================================>] 2.06G 1.65MB/s in 32m 6s

2021-12-20 11:16:18 (1.09 MB/s) - ‘nucl_gb.accession2taxid.gz’ saved [2210782615]

--2021-12-20 11:16:18-- ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/accession2taxid/nucl_wgs.accession2taxid.gz => ‘nucl_wgs.accession2taxid.gz’ Resolving ftp.ncbi.nlm.nih.gov... 165.112.9.229, 165.112.9.230, 2607:f220:41f:250::229, ... Connecting to ftp.ncbi.nlm.nih.gov|165.112.9.229|:21... connected. Logging in as anonymous ... Logged in! ==> SYST ... done. ==> PWD ... done. ==> TYPE I ... done. ==> CWD (1) /pub/taxonomy/accession2taxid ... done. ==> SIZE nucl_wgs.accession2taxid.gz ... 3814389489 ==> PASV ... done. ==> RETR nucl_wgs.accession2taxid.gz ... done. Length: 3814389489 (3.6G) (unauthoritative)

nucl_wgs.accession2taxid.gz 100%[================================================================================================>] 3.61G 1.64MB/s in 42m 1s

2021-12-20 11:58:23 (1.47 MB/s) - ‘nucl_wgs.accession2taxid.gz’ saved [3875497401]

Downloaded accession to taxon map(s) --2021-12-20 11:58:23-- ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/taxdump.tar.gz => ‘taxdump.tar.gz’ Resolving ftp.ncbi.nlm.nih.gov... 165.112.9.229, 130.14.250.13, 2607:f220:41e:250::12, ... Connecting to ftp.ncbi.nlm.nih.gov|165.112.9.229|:21... connected. Logging in as anonymous ... Logged in! ==> SYST ... done. ==> PWD ... done. ==> TYPE I ... done. ==> CWD (1) /pub/taxonomy ... done. ==> SIZE taxdump.tar.gz ... 56919316 ==> PASV ... done. ==> RETR taxdump.tar.gz ... done. Length: 56919316 (54M) (unauthoritative)

taxdump.tar.gz 100%[================================================================================================>] 54.28M 1.67MB/s in 40s

2021-12-20 11:59:07 (1.35 MB/s) - ‘taxdump.tar.gz’ saved [56919316]

Downloaded taxonomy tree data Uncompressing taxonomy data... gzip: abort: corrupted gzip stream -- crc32 mismatch: nucl_gb.accession2taxid.gz

Plase help me to solve this issue


Describe the bug A clear and concise description of what the bug is.

To Reproduce Steps to reproduce the behavior:

Expected behavior A clear and concise description of what you expected to happen.

Screenshots If applicable, add screenshots to help explain your problem.

Environments e.g Kuberenetes, AWS, GCP, Data centers, laptop, etc.

Additional context Add any other context about the problem here--environment configurations, logs, etc.