Closed lfoppiano closed 6 years ago
It's actually a pdf2xml issue, the tokens in the produced XML file are all empty when they arrive to GROBID.
Moving this issue to pdf2xml... Note: file is copyrighted, I replace it with a link to the pdf on publisher site.
In the following PDF:
https://link.springer.com/chapter/10.1007%2F978-3-642-21560-5_33
no fulltext and no bibliographical information are extracted, here the output xml obtained: