freelawproject / doctor

A microservice for document conversion at scale
https://free.law/projects/doctor
BSD 2-Clause "Simplified" License
57 stars 15 forks source link

fix(HTML): Add better error message and latin1 encoding #136

Closed flooie closed 2 years ago

flooie commented 2 years ago

HTML continues to fail when handling latin-1 encoded or partially encoded html files. I am trying to pin point the failing location but I am currently unable to reproduce from a file.