gsautter / goldengate-imagine

Automatically exported from code.google.com/p/goldengate-imagine
Other
1 stars 0 forks source link

zootaxa: materialsCitation not detected DB2D0877A538FFD68B1BFF87FFC3FFC0 #867

Open myrmoteras opened 4 years ago

myrmoteras commented 4 years ago

image

gsautter commented 4 years ago

On the paragraph proper, I found the materials citations detect OK, if only after a few corrections to details: specimen codes like 100121 erroneously tag as dates, as this actually might be a condensed form of 1921-01-10 ... there are early databases that used this format, so not all too far-fetched it could make it into an article.

After removing those dates, the splitting works spot-on, and most details come up nicely as well. Before that, I tend to think some filter kicked in in the full-document run due to the erroneous splits.