Closed Davidgithub1 closed 4 years ago
How would you define "a good spider tool"?
For a general case, HTTrack seems to be a quite good one.
I've used HTTrack but it doesn't download many websites that ScrapbookX is able to.
Which websites? Please be more specific.
That's why I use ScapbookX !
SBX does not support preserving original file structure, which is what some user care about most. On the other hand, HTTrack does not support advanced DOM rewriting, such as not saving scripts, videos, etc. It really depends on your use case, and just choose the tool that works best for you.
ScapbookX works best for me. But I'm afraid websites will stop supporting old firefox. Can you create a version that uses Headless Chrome to save an entire website (follow links)?
Handling headless Chrome is too much work, we are unlikely going to do that.
ScrapBook X already has this feature. As for this feature in WebScrapBook, we'll track in its source repo.
What's a good spider tool? Every tool I've used was not able to download websites that ScrapbookX was able to.