Open drumyerscough opened 1 year ago
I think these were primarily cases where the PDB mmCIF file contained weird taxonomy entries like "Species1 & Species2". I just dropped these instead of coming up with some ad-hoc solution. I am not sure if there is "cleaner" input elsewhere.
Hello,
I've noticed that for a particular query a small percentage (~4%) of hits from the PDB100 do not include taxonomic identifiers. The last two fields of these lines in the m8 files are "0 unclassified" even though taxonomic identifiers do seem to be present when I view the structures on the PDB website. This isn't a major problem, but it is annoying given that the server uses the PDB100 and I'm using taxonomic identifiers to remove different structures of the same protein, point mutants, etc.
I can upload the query structure and the m8 file if needed.
Thanks!