Police-Data-Accessibility-Project / scrapers

Code relating to scraping public police data.
https://pdap.io
GNU General Public License v3.0
157 stars 33 forks source link

Muckrock Data Portal Scraper #227

Closed EvilDrPurple closed 10 months ago

EvilDrPurple commented 10 months ago

Fixes

Description

Docs

EvilDrPurple commented 10 months ago
  • Can you explain why someone would run muckrock_scraper instead of just using one of the templates with their chosen URL?

    • would recommend "FOIA" instead of "foi", pointing here at first mention

    • I can't think of a use case for scraping an individual file, if the user has a URL for the file's direct download. Maybe cut this one?

    • I think the use cases are good, but I might reformulate the readme so that it is a little more cut and dry. Assume the person running this barely knows how to use the terminal and needs it spelled out for them. Something like this:

I want to scrape all files from a FOIA request

1. Open `template/single_file_template.py` in a text editor.

2. Change the `url` variable to the URL of a FOIA request in MuckRock, and save your changes. Be sure it ends in #files!

3. In your terminal, `cd` into `template/`

4. Run `python3 single_file_template.py`

@josh-chamberlain We discussed this a little in private but I figured I'd lay it out here in case there's any confusion.

josh-chamberlain commented 10 months ago

@EvilDrPurple fantastic, thanks! passing to @mbodeantor