pgh-public-meetings / city-scrapers-pitt

Pittsburgh City Scrapers: sourcing public meetings in Pittsburgh
https://pgh-public-meetings.github.io/events/
MIT License
19 stars 66 forks source link

Update dependencies, move to OCD pipeline #81

Closed pjsier closed 4 years ago

pjsier commented 4 years ago

This shouldn't be merged until we've had a chance to talk about it some more.

This PR updates some of the project dependencies and moves the output from using the JSCalendar pipeline (now deprecated) to the Open Civic Data specification pipeline based on the OCD spec for events.

The Open Civic Data specification is a standard for representing information about government, from elected officials to government meetings. Early on in the scrapers project we were trying to find a standardized output for scraped results, and had initially used something based off the the OCD spec with some changes. JSCalendar was something we tried for a bit, but after doing some more research it seems like the OCD events schema is the most adopted standard for representing government meetings. The primary benefit of this is not reinventing the wheel and going with something created by other organizations facing similar problems of representing government information.

This change ends up being pretty minimal for scrapers (none of them are changed here) since it's only modifying the output files, which contain largely the same information.

It also updates some platform-specific installs for the Pipfile so that pywin32 is automatically installed on Windows and pathlib2 is installed in Python 3.5. Each of these were pain points for troubleshooting in the past, and this should resolve some confusion there.

Additionally, it adds a pipenv script for running the style command so that you can run pipenv run style instead of finding the specific command.