neuml / paperetl

📄 ⚙️ ETL processes for medical and scientific papers
Apache License 2.0
352 stars 27 forks source link

sample lines for running etl server and grobid instance #53

Open amscosta opened 8 months ago

amscosta commented 8 months ago

Apologies if my question is too silly. In the description you wrote : "PDF parsing relies on an existing GROBID instance to be up and running. It is assumed that this is running locally on the ETL server" Can you provide some sample lines about how to do that? Best regards

davidmezzetti commented 8 months ago

I usually download and build GROBID per these instructions: https://grobid.readthedocs.io/en/latest/Install-Grobid/

Then start it locally with ./gradlew run

Alternatively, you can run it with Docker: https://grobid.readthedocs.io/en/latest/Run-Grobid/