alexadam / save-as-ebook

Save a web page/selection as an eBook (.epub format) - a Chrome/Firefox/Opera Web Extension
MIT License
1.1k stars 70 forks source link

HTML markup in output epub file #23

Open x5ERZzk opened 6 years ago

x5ERZzk commented 6 years ago

Hello, in latest Firefox (archlinux, clean profile, 59.0.1, the only extension is "Save as eBook"), one website is saved as HTML markup. This is https://www.polskieradio.pl/39/156/Artykul/1068055,Stefan-Kisielewski-muzyk-polityk-felietonista It doesn't matter if I save the full page or a selection of it. Other websites I tested work fine.

Many, many thanks for this wonderful extension!

alexadam commented 6 years ago

Hi, Thanks for reporting this. I tested the link in the example and it does not work with the html parsers I currently use. I checked the source code with a html validator and it has a lot of errors: https://validator.w3.org/nu/?doc=https%3A%2F%2Fwww.polskieradio.pl%2F39%2F156%2FArtykul%2F1068055%2CStefan-Kisielewski-%25E2%2580%2593-muzyk-polityk-felietonista Until I find & integrate a better html parser I won't be able to fix this. Sorry for the inconvenience.

x5ERZzk commented 6 years ago

Thank you very much for investigating the issue. I'm sorry for the trouble it caused.

Lambik commented 6 years ago

Another site with this same issue: https://markmanson.net/this-is-water (and the other articles on the site)

duynguyenhoang commented 5 years ago

I have the same issue. I am trying to find a better way to overcome this issue