City-Bureau / city-scrapers

Scrape, standardize and share public meetings from local government websites
https://cityscrapers.org
MIT License
334 stars 311 forks source link

Spider: Detroit Department of Transportation #721

Closed pjsier closed 5 years ago

pjsier commented 5 years ago

URL: https://detroitmi.gov/Calendar-and-Events Spider Name: det_transportation Agency Name: Detroit Department of Transportation

See the contribution guide for information on how to get started

This scraper is part of the general Detroit website which does a good job standardizing meeting information so most of the work is done for you in DetCityMixin.

To pull most of the information for a general detroitmi.gov scraper, you need to go to the events page, filter for the agency in the Government select field, and then get the ID that shows up next to term_node_tid_depth_1= on the results page URL.

Then, to pull any related documents, go the the documents page, look for relevant document categories in the Government autocomplete field, filter for them, and then pull the ID from the parameter in the results page URL that follows field_department_target_id_1. On initial filter, it will pull in a label too (ex. for the City Planning Commission it shows City+Planning+Commission+Minutes+(3761)) but you only need the number in parentheses.

Once you've included that information, you can override methods like _parse_title if the defaults aren't right, and you'll still need to implement _parse_classification. See det_city_planning or det_zoning_appeals for some examples

pjsier commented 5 years ago

@nbearman if you're interested in doing this it's all yours!

nbearman commented 5 years ago

I gave this one a go and just made a pull request.

pjsier commented 5 years ago

Closed by #731, thanks @nbearman!