Open pashpashpash opened 3 months ago
Are you using OCR?
Are you using OCR?
Nope. OCR is off. 30+ s for basic PDFs
Same here. I'm getting 49.8 seconds for this 78 page PDF, no OCR: https://www.hireexpress.com.au/files/operation_manuals/200840_O.pdf
Anyone found a way to speed it up?
Edit: It's closer to 80 seconds for the above PDF
@ansukla sorry to bother, but any chance we can get an update/priority on this? It's severely impacting production performance
As mentioned here: https://github.com/nlmatics/nlm-ingestor/issues/37
Chunking even small PDFs (<20 pages) takes longer than 30 seconds! This is a huge problem in any production environment. Why is this happening?