salsadigitalauorg / merlin-framework

Merlin - migration framework
GNU General Public License v3.0
17 stars 3 forks source link

Redirects do not resepect rules in `shouldCrawl` #128

Closed stooit closed 4 years ago

stooit commented 4 years ago

Describe the bug Crawlers follow_redirects option means links that redirect are included, which is good. However redirects that end up on external domains should not be included in the crawl.

Sample configuration

Expected behavior Redirects should be restricted to the rules applied elsewhere (e.g shouldCrawl)

stooit commented 4 years ago

Cannot replicate