soedinglab / hh-suite

Remote protein homology detection suite.
https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-019-3019-7
GNU General Public License v3.0
515 stars 128 forks source link

Segmentation fault when running hhblits against custom database [fixed] #293

Open almutwerner opened 2 years ago

almutwerner commented 2 years ago

Hey everyone, I encountered this issue but was able to fix it myself. I'll leave it here, in case anyone encounters the same problem.

I created a custom database starting with MSA files, based on the tutorial. When I tried to open the nohup log, it was asking me if I really wanted to open it, since it could be a binary and the names of my files where butchered, like AAA32308.1__J02451_CDS.faa.mafft<AA>p<A0>^C 58656246 1045 126 0 but I thought nothing of it. Later I ran hhblits against this database and kept getting segmentation faults along the way. I traced it back to the part of the database creation, where you add a consensus sequence (that prints the previous table). Apparently, the file names can't be longer than 32 symbols, otherwise it will cause trouble. Once I cut the file names to the allowed size, all went smoothly.

Edit: I just saw, this is also the case for ffindex_build -s ../<db>_msa.ff{data,index} . already