City-Bureau / city-scrapers-cle

City Scrapers project for Cleveland
https://cityscrapers.org/
MIT License
15 stars 14 forks source link

Fix Cuyahoga elections spider #65

Closed SimmonsRitchie closed 10 months ago

SimmonsRitchie commented 10 months ago

What's this PR do?

Fixes our Cuyahoga County Board of Elections spider (aka. cuya_elections), which broke due to page structure and URL changes.

Why are we doing this?

We want working scrapers, of course 🤖 The changes in this PR include changes to URLs and certain parsing methods.

Steps to manually test

After installing the project using pipenv (see Readme):

  1. Activate the virtual environment:

    pipenv shell
  2. Run the spider:

    scrapy crawl cuya_elections -O test_output.csv
  3. Monitor the stdout and ensure that the crawl proceeds without raising any errors. Pay attention to the final status report from scrapy.

  4. Inspect test_output.csv to ensure the data looks valid. I suggest opening a few of the URLs under the source column of test_output.csv and comparing the data for that row with what you see.

Are there any smells or added technical debt to note?