kemayo / leech

Turn a story on certain websites into an ebook for convenient reading
MIT License
154 stars 24 forks source link

Certain generated epubs do not read with google books #41

Open mrothbart opened 3 years ago

mrothbart commented 3 years ago

Heretical edge, Vacant Throne, Fifth Defiance, saga of soul all produce ebooks that can be read with calibre but not google books. I am unclear as to why that is.

Deathworlders simply errors out without producing anything.

kemayo commented 3 years ago

I don't have easy access to google books to test this -- is it very unfriendly to epubs with minor validation errors? The approach taken here of downloading arbitrary HTML and cramming it into an epub makes it easy to wind up with e.g. unsupported properties.

kemayo commented 3 years ago

Saga of Soul had some obvious issues with namespaced elements, so I added some filtering for that. Vacant Throne had some messed up indexing which might have been confusing matters, I guess? So I fixed its example somewhat.

mrothbart commented 3 years ago

What I did to create a valid Vacant THrone epub was to save the HTML in a file, strip all href and tags besides for the ToC, then serve it locally from a WSL server and point the script at it. It worked great, but obviously not scalable.