18F / privacy-tools

GSA PII Dashboard
https://cg-9341b8ea-025c-4fe2-aa6c-850edbebc499.app.cloud.gov/site/18f/privacy-dashboard/
MIT License
2 stars 4 forks source link

Write scraper for SORN data #2

Closed ondrae closed 4 years ago

ondrae commented 4 years ago

What

Write code that will pull all SORN data from each of the listed GSA SORNs.

How

  1. Gather each individual SORN links.
  2. Figure out the XML links from the HTML urls.
  3. Save the sections we care about to a CSV.
  4. Copy the CSV data to our online spreadsheet by hand. It would take a while to do it with code.

Why

To finish getting all of GSAs Privacy Data into one place.

Acceptance

Done when all of GSAs SORNs have been scraped and added to our spreadsheet.