City-Bureau / city-scrapers

Scrape, standardize and share public meetings from local government websites
https://cityscrapers.org
MIT License
329 stars 311 forks source link

🕷️ Fix spider: Chicago Commission on Human Relations #1126

Closed SimmonsRitchie closed 1 month ago

SimmonsRitchie commented 1 month ago

What's this PR do?

Fixes our Chicago Commission on Human Relations spider (aka. chi_human_relations).

Why are we doing this?

The spider broke due to changes on the pages it's targeting. The changes in this PR ensure the scraper runs without error.

Steps to manually test

After installing the project using pipenv:

  1. Activate the virtual environment:

    pipenv shell
  2. Run the spider:

    scrapy crawl chi_human_relations -O test_output.csv
  3. Monitor the stdout and ensure that the crawl proceeds without raising any errors. Pay attention to the final status report from scrapy.

  4. Inspect test_output.csv to ensure the data looks valid. I suggest opening a few of the URLs under the source column of test_output.csv and comparing the data for the row with what you see on the page.

Are there any smells or added technical debt to note?