nlmatics / llmsherpa

Developer APIs to Accelerate LLM Projects
https://www.nlmatics.com
MIT License
1.41k stars 139 forks source link

LLMSHERPA API LOCAL HOSTING #105

Open Enaouram opened 2 months ago

Enaouram commented 2 months ago

Hey everyone, I have a problem with the locally hosted llmsherpa api, I've followed every step on https://github.com/nlmatics/nlm-ingestor but still can't get my documents chunked once I'm connected to the endpoint, idk what's the issue

image image
aaryan200 commented 2 months ago

Is it a scanned pdf? If yes, you might wanna use the OCR option (add &applyOcr=yes in the api url).

Enaouram commented 2 months ago

The pdfs I tried parsing are not scanned but I tried adding the '&applyOcr=yes' and still haven't got the chunks/sections of the document, here's what I got in my cmd once I've tried adding the '&applyOcr=yes' in the api url :

image