CrossRef / pdfextract

MOVED TO https://gitlab.com/crossref/pdfextract
https://gitlab.com/crossref/pdfextract
MIT License
508 stars 89 forks source link

Problem with outdent delimited refs #21

Open Bastien1 opened 9 years ago

Bastien1 commented 9 years ago

Hi guys,

So i just installed pdf-extract (pretty neat program btw ;)) but now the PDF I'm feeding it has the outdent delimited references (see http://nlabs.labs.crossref.org/pdfextract/citation_categories/). Unfortunately instead of taking the first line for the reference (the one that is outdented), PDF-extract thinks the reference starts at the line below and ends one line too far (see image). When I play with the reference_flex (lower it 0.15) the reference from the first page comes out ok (the page contains normal text plus reference section) but the references on the following page still get the same indent-outdent problem.. I didn't see any setting to play with the indent outdent, can anyone help ? thanks !

outdent_ex