ArchiveTeam / grab-site

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Other
1.31k stars 129 forks source link

--no-offsite-links doesn't work #183

Closed tripleo1 closed 3 years ago

ivan commented 3 years ago

You might be seeing page requisites (images, CSS, etc) being grabbed rather than offsite links to a depth of one. Do you have a log or repro steps?

tripleo1 commented 3 years ago

Does it do offsite links to a depth of one or none at all. This may be what is confusing me.

TheTechRobo commented 3 years ago

It shoudln't be doing offsite links at all. What you see is most likely, as ivan said, page requisites (such as stylesheets, photos, etc) are most likely what you're seeing

ivan commented 3 years ago

If this is still an issue, please post a log and all the arguments started grab-site with.