soedinglab / hh-suite

Remote protein homology detection suite.
https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-019-3019-7
GNU General Public License v3.0
515 stars 128 forks source link

build customized databases #288

Open Wanli-HE opened 2 years ago

Wanli-HE commented 2 years ago

:exclamation: Make to check out our User Guide.

hi ! i am trying to build a customized databases, using the workflow that recorded in "user guide"

first step using "ffindex_from_fasta" to convert the input FASTA file. command like below: "ffindex_from_fasta -s all-neighbours-refer-and-metagenome.fa.ffdata all-neighbours-refer-and-metagenome.fa.ffindex all-neighbours-refer-and-metagenome.fa"

it is works. but in the next step, using "hhblits_omp" to get MSA file. "hhblits_omp -i all-neighbours-refer-and-metagenome.fa -d /home/projects/ku_00041/data/hh-suite-database/UniRef30_2020_06 -oa3m all-neighbours-refer-and-metagenome_a3m_ww_ss -n 2 -cpu 35"

but it not working, the error like: ------------------------------------------------------------------error--------------------------------------------------------------

' 09:13:31.697 ERROR: unrecognized input file format in '3300029656.a:Ga0245165_100003_39¸

here is my fasta file: image

and the version is 3.30

so do you have any idea about this error.

best, wanli

Wanli-HE commented 2 years ago

ok i had figure out, it is due to the length of protein name