bohdanbobrowski / blog2epub

Convert blog (blogspot.com, wordpress.com...) or any website to epub using GUI, CLI or Python.
https://github.com/bohdanbobrowski/blog2epub
MIT License
40 stars 6 forks source link

[Clock ] Warning, too much iteration done before the next frame. #33

Open meedstrom opened 4 days ago

meedstrom commented 4 days ago

Crime scene investigation:

I ran two instances of blog2epubgui at the same time, don't know if that's safe.

In one, I was downloading https://www.wisdomandwonder.com/ until post 1189 (skipping any more due to #32). This proceeded to finish cleanly.

In the other, I was downloading https://thingofthings.wordpress.com/.

It seemed to stop doing anything on post 701. Tail of terminal output had no real information:

[INFO   ] 699. 
[DEBUG  ] [https          ]//thingofthings.wordpress.com:443 "GET /2015/01/08/what-its-like-having-borderline-personality-disorder/ HTTP/11" 200 None
[INFO   ] Downloading
[Level 5           ] ascii passed initial chaos probing. Mean measured chaos is 0.000000 %
[Level 5           ] ascii should target any language(s) of ['Latin Based']
[DEBUG  ] [Encoding detection] ascii is most likely the one.
[INFO   ] 700. 
[DEBUG  ] [https          ]//thingofthings.wordpress.com:443 "GET /2015/01/05/on-weight-loss-and-health/ HTTP/11" 200 None
[INFO   ] Downloading
[Level 5           ] ascii passed initial chaos probing. Mean measured chaos is 0.000000 %
[Level 5           ] ascii should target any language(s) of ['Latin Based']
[DEBUG  ] [Encoding detection] ascii is most likely the one.
[INFO   ] 701. 

But I thought it might be a fluke, so I tried again, with only one instance of blog2epubgui.

First, I noticed that only 300 or so posts were cached (is there a cache limit?), but ok.

Then, it finished on post 737 and produced an epub -- but I saw some interesting output in the terminal.

(PS: the initial post on the blog is actually 735, see https://thingofthings.wordpress.com/page/74/, so not sure what are 736 and 737.)

Sidenote: in the finished epub, it seems that a few posts are out of order: after the final 2020 post (https://thingofthings.wordpress.com/2020/12/11/blaze-of-glory-a-useful-rpg-house-rule/), there is the 2014 post https://thingofthings.wordpress.com/2014/12/08/on-boston/ and 5 other posts, and only then the book ends.

But mostly it's in correct order.

(hey! now I'm noticing several other epubs are not quite in creation-order! at least https://kajsotala.fi or https://eukaryotewritesblog.com/, they're a bit random order)