Open kchennen opened 6 months ago
Automatic taxid extraction works only for mmCIF files that have one of the following three fields somewhere:
_entity_src_nat.pdbx_ncbi_taxonomy_id
_entity_src_gen.pdbx_gene_src_ncbi_taxonomy_id
_pdbx_entity_src_syn.ncbi_taxonomy_id
As PDB files don't commonly contain (easily extractable) taxonomy information we don't try to read that information.
Doing this by hand is possible however a bit more involved. See the following MMseqs2 wiki section: https://github.com/soedinglab/MMseqs2/wiki#create-a-seqtaxdb-by-manual-annotation-of-a-sequence-database
Hello, I am having issues to run foldseek easy-search on my custom database with only yeasts AF2 pdb files. I think that the problem might be because I am asking for the "taxid,taxname" columns in the output format. How can I create these missing files for my custom target database? Could you please help me?
Current Behavior
Steps to Reproduce (for bugs)
Foldseek Output (for bugs)