Closed Fede112 closed 1 year ago
Found the solution. The issue was in the way hhconsensus annotates the consensus protein. By just adding a comment on top of each MSA I got it working as I wanted to. As an example:
# FAM1231
>Prot1| xx-yy
KDGVSGTSDLKLLGAARARLR
Still it would be nice to be able to choose this from command line.
Expected Behavior
I succesfully built a custom database from a set of protein families, each represented by an MSA file (F0001.fas - F0002.fas - etc. ) The problem comes when searching against my custom database. I wanted hits found using
hhsearch
to be named according to the original MSA files. Instead they are not. The goal is to reproduce the results obtained when searching against the prebuilt pfam35 database:This is not a bug but it would be nice to be able to change the target name if needed.
Current Behavior
Using hhsearch against my custom database produces hits named after the first protein (or consensus one?) and not the original family name specified in the respective MSA file.
Steps to Reproduce (for bugs)
1) Build custom database as detailed in the user guide (section: starting with MSAs)
2)
hhsearch -cpu 4 -i query.hhm -d ./databases/custom_db -o query.hhr
Your Environment