Closed mjwestgate closed 3 years ago
Ah yes I agree this test should be taken out. Could be replaced with a check for numeric characters? It is probably also worth ensuring that in general one name can't corrupt the whole vector- I see two options here:
I found a bug when searching for a large number of taxonomic names at once. First, the error messages appeared wrong, being of the form:
No match found for identifier [supplied name here]
More importantly, the resulting
data.frame
only contains information on issues, i.e.:Digging into the code, it appears that line 190-193 of
select_taxa()
automatically interprets strings of length 32 or 36 as indicating that the string in question is an ID, not a taxonomic name:From the annotation it appears that particular databases have identifiers of this length. However something more specific is needed here, as this behaviour can't be changed by the user. Also in any vector of sufficient length it is likely that at least one string will have 32 or 36 characters long (I found two in a vector of length 1175).