freelawproject / doctor

A microservice for document conversion at scale
https://free.law/projects/doctor
BSD 2-Clause "Simplified" License
54 stars 14 forks source link

fix(tasks): Encode bytes correctly in extract html #134

Closed flooie closed 2 years ago

flooie commented 2 years ago

Extract HTML had some ... python2-esque encoding going on. This fix should resolve some bad encoding that was going on and HTML extraction

flooie commented 2 years ago

This got unfortunately held up by a mismatch with the new seal-rookery.

flooie commented 2 years ago

This is a small tweak to open HTML files with proper encoding.

mlissner commented 2 years ago

Ah, right. My fault. I harmonized the seal and portraits repos to both use search. Should have remembered to update CL.

flooie commented 2 years ago

Should've seen it earlier. Thanks for harmonizing it