Closed vortex14 closed 9 years ago
I can't understand, could you please explain it more specific?
From Example Scrapy frameworks: rules = [Rule(LinkExtractor(allow=['/*']), callback="parse_page", follow=True)]
*follow=True — follow until there are links to pages.
How I can do with pyspider too?
def parse_page(self, response):
for each in response.doc('a[href^="http"]').items():
self.crawl(each.attr.href, callback=self.parse_page)
Thanks
How to get around the site until it has links? Can show a simple example?