CrossRef / pdfextract

MOVED TO https://gitlab.com/crossref/pdfextract
https://gitlab.com/crossref/pdfextract
MIT License
508 stars 89 forks source link

Not all references extracted #36

Open robinwalter opened 8 years ago

robinwalter commented 8 years ago

Hello everybody!

I have tried to extract references from severel papers but it is always missing 10-11 references. In the papers the are first 10-11 under the title "Reference". They are either on a another page or in another column in the text.

I have tried with --set reference_flex:0.25, and also 0.1 to 0.9. No different in extracting.

pdf-extract works perfect otherwise.

Anyone know what I can do to fix this?

thansk in advance! Robin