If a representative sequence is present in a mock community (and hence the expected taxonomy file) but is not classified by some taxonomy classifiers (ahem... current sklearn naive-bayes), the eval throws an error:
We want this behavior because because we want to catch scenarios where, e.g., classification failed and empty files are output, or other silly things like that that do happen.
The ideal solution is to enforce that classifiers report unassigned observations, so that there is no ambiguity about the classification of this observation or if/where this observation was missing.
However, this could be a wall for some, e.g., if a non-developer of classifier X wants to include it in an evaluation.
Any ideas on how to resolve? E.g., allow override of this AssertionError?
If a representative sequence is present in a mock community (and hence the expected taxonomy file) but is not classified by some taxonomy classifiers (ahem... current sklearn naive-bayes), the eval throws an error:
We want this behavior because because we want to catch scenarios where, e.g., classification failed and empty files are output, or other silly things like that that do happen.
The ideal solution is to enforce that classifiers report unassigned observations, so that there is no ambiguity about the classification of this observation or if/where this observation was missing.
However, this could be a wall for some, e.g., if a non-developer of classifier X wants to include it in an evaluation.
Any ideas on how to resolve? E.g., allow override of this AssertionError?