soedinglab / hh-suite

Remote protein homology detection suite.
https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-019-3019-7
GNU General Public License v3.0
515 stars 128 forks source link

hhsuitedb.py generates empty cs219 files #296

Open shaharbr opened 2 years ago

shaharbr commented 2 years ago

Expected Behavior

Build a database with hhsuitedb.py

Current Behavior

cs219.ffdata and cs219.ffindex files are generated as empty files

Steps to Reproduce

Command ran: hhsuitedb.py --ia3m=genes_clust_95_PCs/a3m --ihhm=genes_clust_95_PCs/.hmm -o genes_clust_95_all_db --cpu 2

Content of the directory genes_clust_95_PCs/ :

HH-suite Output:

PC_00001_ali.a3m    0   222685  104 0
PC_00002_ali.a3m    222685  180206  85  0
PC_00003_ali.a3m    402891  154761  80  0
WARNING: Missing entry PC_00003_ali.a3m in genes_clust_95_all_db_cs219.ff{data,index}!
WARNING: Missing entry PC_00002_ali.a3m in genes_clust_95_all_db_cs219.ff{data,index}!
WARNING: Missing entry PC_00001_ali.a3m in genes_clust_95_all_db_cs219.ff{data,index}!
You may try to use the option --force to fix the database!

Files generated:

       88 Oct 28 16:42 genes_clust_95_all_db_a3m.ffindex
   557652 Oct 28 16:42 genes_clust_95_all_db_a3m.ffdata
        0 Oct 28 16:42 genes_clust_95_all_db_cs219.ffindex
        0 Oct 28 16:42 genes_clust_95_all_db_cs219.ffdata
       83 Oct 28 16:42 genes_clust_95_all_db_hhm.ffindex
    97308 Oct 28 16:42 genes_clust_95_all_db_hhm.ffdata

Content of output files:

Context

Trying to run hhsuitedb.py as part of this code (line 149).

Your Environment

The hhsuite version I installed was compiled from source by cloning from github on October 27. Running on Ubuntu 20.04.3 LTS System memory - MemTotal: 16151700 kB