dinosauria123 / gcv2hocr

gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.
99 stars 33 forks source link

enforce encoding="utf-8" #28

Closed ailintom closed 5 years ago

ailintom commented 5 years ago

Encoding="utf-8" should be enforced, otherwise the script fails in environments with a non-standard command line locale.

The UTF-8 encoding of the resulting file was already specified in the tag. Hence, Python 3 should always output a utf-8 file.

dinosauria123 commented 5 years ago

Thank you for many patches !