Open nlkinlock opened 2 years ago
@nlkinlock Sorry I missed this issue report. I was traveling at the time and the notification got buried in my inbox.
This is very strange. At a glance, it looks like Name_submitted="Acronychia octandra" is being repeated and assigned to the results for Actinidia persicina. This is reminiscent of EnquistLab/RTNRS#14, which I believe is related to a bug in the perl parallel controller code. But Overall_score=0.9 doesn't apply to any name in your example batch. I have no idea where that comes from.
Unfortunately I can't replicate the issue, even after trying a couple of times with your names. But I'm about to start working on EnquistLab/RTNRS#14. Refactoring the controller may solve several issues. I'll keep an eye out for this one as well.
I'm running into an issue when resolving large numbers of names using the TNRS API via the R package or the web application.
Periodically (~1 name per 1,000), two names will be returned for a name with a single exact match in the backbone. The second name is often taxonomically unrelated to the first, and both names are included in the input list. For both returned names, the output always shows an Overall_score of exactly 0.9 and the Genus_submitted as the genus of the extra name. Below is an example of this output:
Unfortunately, I can't consistently replicate this issue. When resolving the same taxon list repeatedly, it will occur for different names. I'm standardizing using the WFO backbone, but the same issue occurs using the WCVP and Tropicos backbones. I've attached a text file with taxon names that will trigger this issue the majority of the time.
test_taxa.txt .