There are at least two problems that need be addressed with alle_improvments:
The domain name needs to be changed to https://www.alleghenycounty.us/economic-development/authorities/meetings-reports/aim/meetings.aspx. The old domain name doesn't return a 404 or a 403. Instead it never responds (!), causing the scraper to get stuck waiting.
When I run pipenv run scrapy crawl alle_improvements -o aim.json from root I should see no errors and at least one event (there appears to be twelve currently).
Here is the error I experienced after fixing the first problem. Maybe start by figuring out why tmp==None.
time_str = self._parse_start_time(data)
File "/Users/ben/Desktop/city/city-scrapers-pitt/city_scrapers/spiders/alle_improvements.py", line 92, in _parse_start_time
return " ".join(tmp.split())
AttributeError: 'NoneType' object has no attribute 'split'
There are at least two problems that need be addressed with
alle_improvments
:https://www.alleghenycounty.us/economic-development/authorities/meetings-reports/aim/meetings.aspx
. The old domain name doesn't return a 404 or a 403. Instead it never responds (!), causing the scraper to get stuck waiting.pipenv run scrapy crawl alle_improvements -o aim.json
from root I should see no errors and at least one event (there appears to be twelve currently).Here is the error I experienced after fixing the first problem. Maybe start by figuring out why
tmp==None
.