Open meedstrom opened 4 days ago
Crime scene investigation:
I ran two instances of blog2epubgui at the same time, don't know if that's safe.
In one, I was downloading https://www.wisdomandwonder.com/ until post 1189 (skipping any more due to #32). This proceeded to finish cleanly.
In the other, I was downloading https://thingofthings.wordpress.com/.
It seemed to stop doing anything on post 701. Tail of terminal output had no real information:
[INFO ] 699. [DEBUG ] [https ]//thingofthings.wordpress.com:443 "GET /2015/01/08/what-its-like-having-borderline-personality-disorder/ HTTP/11" 200 None [INFO ] Downloading [Level 5 ] ascii passed initial chaos probing. Mean measured chaos is 0.000000 % [Level 5 ] ascii should target any language(s) of ['Latin Based'] [DEBUG ] [Encoding detection] ascii is most likely the one. [INFO ] 700. [DEBUG ] [https ]//thingofthings.wordpress.com:443 "GET /2015/01/05/on-weight-loss-and-health/ HTTP/11" 200 None [INFO ] Downloading [Level 5 ] ascii passed initial chaos probing. Mean measured chaos is 0.000000 % [Level 5 ] ascii should target any language(s) of ['Latin Based'] [DEBUG ] [Encoding detection] ascii is most likely the one. [INFO ] 701.
But I thought it might be a fluke, so I tried again, with only one instance of blog2epubgui.
First, I noticed that only 300 or so posts were cached (is there a cache limit?), but ok.
Then, it finished on post 737 and produced an epub -- but I saw some interesting output in the terminal.
(PS: the initial post on the blog is actually 735, see https://thingofthings.wordpress.com/page/74/, so not sure what are 736 and 737.)
Sidenote: in the finished epub, it seems that a few posts are out of order: after the final 2020 post (https://thingofthings.wordpress.com/2020/12/11/blaze-of-glory-a-useful-rpg-house-rule/), there is the 2014 post https://thingofthings.wordpress.com/2014/12/08/on-boston/ and 5 other posts, and only then the book ends.
But mostly it's in correct order.
(hey! now I'm noticing several other epubs are not quite in creation-order! at least https://kajsotala.fi or https://eukaryotewritesblog.com/, they're a bit random order)
Crime scene investigation:
I ran two instances of blog2epubgui at the same time, don't know if that's safe.
In one, I was downloading https://www.wisdomandwonder.com/ until post 1189 (skipping any more due to #32). This proceeded to finish cleanly.
In the other, I was downloading https://thingofthings.wordpress.com/.
It seemed to stop doing anything on post 701. Tail of terminal output had no real information:
But I thought it might be a fluke, so I tried again, with only one instance of blog2epubgui.
First, I noticed that only 300 or so posts were cached (is there a cache limit?), but ok.
Then, it finished on post 737 and produced an epub -- but I saw some interesting output in the terminal.
(PS: the initial post on the blog is actually 735, see https://thingofthings.wordpress.com/page/74/, so not sure what are 736 and 737.)
Sidenote: in the finished epub, it seems that a few posts are out of order: after the final 2020 post (https://thingofthings.wordpress.com/2020/12/11/blaze-of-glory-a-useful-rpg-house-rule/), there is the 2014 post https://thingofthings.wordpress.com/2014/12/08/on-boston/ and 5 other posts, and only then the book ends.
But mostly it's in correct order.
(hey! now I'm noticing several other epubs are not quite in creation-order! at least https://kajsotala.fi or https://eukaryotewritesblog.com/, they're a bit random order)