Closed iannesbitt closed 1 year ago
Related:
The spider should be able to follow delays and other relevant overrides in a node-specific settings file. For example, Harvard Dataverse wants a 4 second delay between requests, but most other sites can be crawled more rapidly.
Makes more sense as settings.json
settings.json
Followed suggestion from https://docs.scrapy.org/en/latest/topics/settings.html#settings-per-spider to implement settings changes in Spider.from_crawler()
Spider.from_crawler()
Tested and working.
Related:
23
The spider should be able to follow delays and other relevant overrides in a node-specific settings file. For example, Harvard Dataverse wants a 4 second delay between requests, but most other sites can be crawled more rapidly.