ourcanadian / ocse-core

Core of OurCanadian Search Engine
0 stars 0 forks source link

Enable Scrapy Settings #5

Open rylancole opened 4 years ago

rylancole commented 4 years ago

Path ocse-core/coast_to_coast/coast_to_coast/settings.py

Most settings are commented out in this file, the only one I personally enabled is the UserAgent.

# Crawl responsibly by identifying yourself (and your website) on the user-agent
USER_AGENT = 'OurCanadianCrawler (+http://wiki.ourcanadian.ca)'

We will need to go through each of these settings and determine how they apply to us and what are the best options for us to optimize our crawling. The file lists all the relevant documentation links.