gsautter / goldengate-imagine

Automatically exported from code.google.com/p/goldengate-imagine
Other
1 stars 0 forks source link

article does not process: zootaxa zootaxa.4372.2.3 #344

Open myrmoteras opened 7 years ago

myrmoteras commented 7 years ago

zootaxa.4372.2.3.pdf

One more of the files that get hung up with marking materials citation

Processing document 'E:\diglib\zootaxa\temp\zootaxa.4372.2.3.pdf'

D:\GoldenGateImagine20170823>

gsautter commented 7 years ago

Anything at all in the logs? The error log at least?

Anyway, taking a close look at the MCs throughout this one, it turns out they are quite irregular, both with regard to detail ordering and with regard to the punctuation marks separating the details:

As this irregularity exists in each and every MC individual paragraph, inferring record boundaries becomes a true challenge. Might well be the search runs away for some paragraph simply because there is nothing to converge upon ... I'll try and think something up, but I'm quite reluctant to make any promises here.