Open michamos opened 8 months ago
Hi @michamos, long time not see 😄 It's nice that you're back working with Grobid? Thanks for opening the issue.
It seems more a problem due to how Jakarta selects the default when Accept is not specified. In local, when I use the same request you posted, I get TEI-XML, however I think it depends how the methods are loaded. It seems that there is no clear behaviour, althought this looks strange.
One solution I saw is to add an additional filter to default the Accept to application/xml
when undefined, but it seems a bit of a hack and might affect other endpoints.
I will check it out a bit more in detail
Hi @lfoppiano, indeed :) We've been using GROBID in prod for INSPIRE for quite a while now. We use it to extract author and affiliation info from PDFs and to segment references for interactive search (so users can copy/paste references from a paper and it magically works). Unfortunately, our current resources are very limited, so we can't really contribute beyond submitting bug reports.
Thanks for looking into the issue!
I dug into this and did not find a clean solution. I'm quite surprised that there is no way to define a default behavior. It seems that the behavior is random depending on the platform where it's running.
Nevertheless, I updated the documentation, though, stating that the Accept
header is required.
Hi, I noticed that, at least since v0.7.3, GROBID started returning bibtex by default for
/api/processHeaderDocument
. This contradicts https://grobid.readthedocs.io/en/latest/Grobid-service/#apiprocessheaderdocument which claims a specialAccept: application/x-bibtex
header must be used for BibTeX and that the default is TEI XML.Note that it's possible to get an XML response by using
Accept: application/xml
.Steps to reproduce
curl https://kermitt2-grobid.hf.space/api/processHeaderDocument --form input=@Downloads/2212.12604v1.pdf
Requested info
Linux amd64 through
lfoppiano/grobid:0.7.3
Docker image & whatever huggingface is usingjava --version
)?openjdk 17.0.2 2022-01-18 OpenJDK Runtime Environment (build 17.0.2+8-86) OpenJDK 64-Bit Server VM (build 17.0.2+8-86, mixed mode, sharing)
--stacktrace
and--info
for better log traces (e.g../gradlew run --stacktrace --info
) or attach the log filelogs/grobid-service.log
.