jonaswinkler / paperless-ng

A supercharged version of paperless: scan, index and archive all your physical documents
https://paperless-ng.readthedocs.io/en/latest/
GNU General Public License v3.0
5.37k stars 358 forks source link

change pdfminer's word_magin=1 for better text extraction results #1735

Open tmbinc opened 1 year ago

tmbinc commented 1 year ago

Experimental change for #1734.

slankes commented 1 year ago

paperless-ng is pretty much abandoned. Have a look at https://github.com/paperless-ngx/paperless-ngx for a maintained fork.