Open edevil opened 7 years ago
I'll leave the broader question of "can the library be fixed to handle this situation?" to Hans, but-
Can I get around this problem
Yeah, to some definition of get around.
body
|> Codepagex.to_string!(:iso_8859_1)
|> Html5ever.parse()
Thanks, @mischov!
Parsing pages not written in UTF-8 currently produces errors:
In this case this XML feed has the encoding in the xml preeamble:
Can I get around this problem or can the library be fixed to handle this situation?