mailgun / talon

Apache License 2.0
1.27k stars 285 forks source link

Fix: Unicode strings with encoding declaration are not supported. Please use bytes input or XML fragments without declaration. #161

Closed glaand closed 5 years ago

glaand commented 6 years ago

Cut out first some encoding html tags such as xml and doctype for avoiding conflict with unicode decoding.

Some HTML Content may contain XML Tags and on extracting it, the HTMLParser triggers the following error: "Unicode strings with encoding declaration are not supported. Please use bytes input or XML fragments without declaration.".

mailgun-ci commented 6 years ago

Can one of the admins verify this patch?

glaand commented 6 years ago

up

obukhov-sergey commented 5 years ago

@mailgun-ci test this please