databrickslabs / tika-ocr

Other
17 stars 2 forks source link

Bug/file limit #31

Closed aamend closed 8 months ago

aamend commented 8 months ago

Error:

Your document contained more than 100000 characters

Fix:

Passing unlimited byte buffer to BodyContentHandler as an option, default to -1 (i.e. unlimited)