dinosauria123 / gcv2hocr

gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.
99 stars 33 forks source link

Iterate through the buf to escape XML chars #26

Closed ailintom closed 5 years ago

ailintom commented 5 years ago

The current XML escaping mechanism is faulty, it produces an infinite loop when encounters an '&', because it replaces '&' with "&" which also contains an '&'

dinosauria123 commented 5 years ago

Thank you for using gcv2hocr And making a great patch !