JBGruber / paperboy

A comprehensive (eventually) collection of webscraping scripts for news media sites
45 stars 2 forks source link

Illegal characters #15

Open JBGruber opened 1 year ago

JBGruber commented 1 year ago

I already tried to fix the weird issues with wrong encodings in 0471ce7165fb06719000ea269d77b0de59b1c4aa and once before, but they still seem to persist. Now I discovered that parse_response_disk does not suffer from the same issue, making readBin the prime suspect to introduce the wrong encoding.