kermitt2 / grobid_client_python

Python client for GROBID Web services
Apache License 2.0
275 stars 74 forks source link

Add --extractRawText to extract raw text from xml #63

Open sanchay-hai opened 1 year ago

sanchay-hai commented 1 year ago

Just extracting all the raw pdf text from the body section

Tested by running locally on a few files