jmw8033 / Pewter

Emailed invoice file handler
2 stars 1 forks source link

PDFs with no text #2

Open jmw8033 opened 1 year ago

jmw8033 commented 1 year ago

When a pdf doesn't actually have any text it uses pytesseract (OCR) to try and read the text, but it kinda sucks. Need to find a better way.

jmw8033 commented 2 months ago

I completely removed the use of pytesseract since it was too inaccurate. Unless a better OCR is found, these invoices will just have to be manually named.