Open oderyn opened 1 year ago
You're passing the URL of page 19 (https://forum.com/forums/showthread.php?12345-title-of-the-post/page19
). Pass the URL of the first page instead: https://forum.com/forums/showthread.php?12345-title-of-the-post
.
Ah! That makes sense.
...
Just tried it. Unfortunately, I got the same result. To be clear, I tried the following:
forum-dl -g --no-boards --no-files https://forum.com/forums/showthread.php?12345-title-of-the-post/
and
forum-dl -g --no-boards --no-files https://forum.com/forums/showthread.php?12345-title-of-the-post/page1
Any other pointers? Otherwise, I can do some more troubleshooting on my own a little later.
Given that it downloads only one thread page, most likely the CSS selector Forum-dl uses to find the next page link is not working on this website in particular (perhaps it's using an older vBulletin version, or is heavily themed). I should be able to fix this if you give me the real link to this forum.
Here are are a couple that I am having issue with:
I am pretty sure I came across a few others that forum-dl wasn't grabbing. I'll need to dig back through my notes as I was testing a variety of forums to see how it performed. I know there was a Xenforo forum, too.
If you're grabbing the pagination via CSS, that's definitely something I could do -- in theory, at least. Have you considered adding these selectors to a config file that users could extend? Or, if you point me to where you are adding it in the code, I could add new ones as I come across them and submit pull requests -- if you're at all interested in that sort of help. I hate to pester you with requests every time I come across something that doesn't work.
BTW, this is a great tool. It does most of what I've been looking for in regards to grabbing full forum threads.
Thanks!
Have you considered adding these selectors to a config file that users could extend?
Yes. Actually, I intend to make it available through a command line switch. But I haven't started working on it yet because I had to focus on another project.
Or, if you point me to where you are adding it in the code, I could add new ones as I come across them and submit pull requests -- if you're at all interested in that sort of help. I hate to pester you with requests every time I come across something that doesn't work.
Pull requests are very much welcome. Vbulletin's next thread page CSS selector is here. Should be trivial to patch.
I've reverted some recent experimental and broken code from develop
so that you won't have to bother with it.
BTW, this is a great tool. It does most of what I've been looking for in regards to grabbing full forum threads.
Thanks!
You're welcome.
Do you have any intention to work on this issue in the near time (as you implied you're interested in that)? I would like to release 0.3.1 in the next few weeks with this bug fixed, would be good to know whether I should wait or fix this myself.
@mikwielgus Apologies for not getting back to you sooner. I thought I could fix the issue, but it was beyond my programming skill level (novice, for sure).
I get the same error, but I am trying to save down the page "view all posts by user"
not the normal forum page itself. its still posts but just another path.
Thanks for making this! Just wanted to add i'm experiencing the same error as the OP with: https://www.rcgroups.com/forums/showthread.php?1074181-Lipo-Storage-Voltage-and-Dead-Battery-Information
I would like to use forum-dl to generate a list of links from a given forum that I could then send to SingleFile to generate HTML pages of all posts in the thread.
I am using this command:
When I run this command for vbulletin, it does not generate a list of all 19 pages in the thread as I would expect to happen -- just the one page that I entered. Like so:
This happens no matter which page in the forum I pass into forum-dl.
When I add -v to the above command, I get the following output:
I tried running the command to output the files to a directory:
I got the following error:
--
Result of pip3 --version
Result of uname -a
Result of cat /etc/os-release