Closed vanegomez closed 7 years ago
In spidr, links are the String version of the full URL. You appear to want to ignore links based on the path. Maybe something like:
spider.ignore_urls_like { |url| url.path.start_with?('/partners/') }
I should probably add ignore_paths_like
to cover that use-case.
@postmodern Thank you so much for answering.
Is it possible to follow external links and check if they are broken?
You would have to explicitly call spider.get_page and check the responses, since the spider won't automatically follow off-site links.
thank you!
cool gem!
I'm trying to ignore going to partners and everything after it in my site www.mysite.com/partners/resellers and is still going to those links.