soedinglab / MMseqs2

MMseqs2: ultra fast and sensitive search and clustering suite
https://mmseqs.com
MIT License
1.42k stars 195 forks source link

createdb dies when setting up GTDB taxonomy database #809

Open rikander opened 9 months ago

rikander commented 9 months ago

Hi folks! Thanks for maintaining this software. I've been trying to set up the GTDB database for taxonomy analysis, but can't seem to get the database set up regardless of the computer used (I've tried both my Linux server as well as my own Mac laptop; mmseqs was installed using conda on the server and using the MacOSX binary available on the mmseqs installation page on the laptop). Details are below. Thanks in advance for any light you can shed on the issue!

Expected Behavior

Creation of a taxonomy database using GTDB as the database.

Current Behavior

the createdb step dies after downloading the GTDB database.

Steps to Reproduce (for bugs)

command: mmseqs databases GTDB outpath/GTDB tmp

MMseqs Output (for bugs)

bash-3.2$ bin/mmseqs databases GTDB outpath/GTDB tmp databases GTDB outpath/GTDB tmp

MMseqs Version: 2fad714b525f1975b62c2d2b5aff28274ad57466 Tsv false Force restart with latest tmp false Remove temporary files false Compressed 0 Threads 8 Verbosity 3

% Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 31 100 31 0 0 41 0 --:--:-- --:--:-- --:--:-- 41 % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 51.3G 100 51.3G 0 0 1565k 0 9:33:26 9:33:26 --:--:-- 2662k % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 54.4M 100 54.4M 0 0 1374k 0 0:00:40 0:00:40 --:--:-- 939k % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 1080k 100 1080k 0 0 489k 0 0:00:02 0:00:02 --:--:-- 491k tar2db tmp/14668141768776664068/gtdb.tar.gz tmp/14668141768776664068/tardb --tar-include faa.gz$ --threads 8 -v 3

Time for merging to tardb: 0h 0m 0s 28ms Time for merging to tardb.lookup: 0h 0m 0s 13ms Time for processing: 0h 7m 32s 203ms createdb tmp/14668141768776664068/tardb outpath/GTDB --compressed 0 -v 3

Converting sequences Cannot open outpath/GTDB.source for writing Error: createdb died

Context

Please note that I get this error on both my local computer and on my server; using different methods of installation.

Your Environment

Include as many relevant details about the environment you experienced the bug in.

hariszaf commented 9 months ago

Had the exact same behaviour on linux.

Amanda-Pinski commented 2 months ago

I have this same issue on linux. I can't figure out what it is.