To get there, we're working with federal privacy offices and structuring data from PDFed privacy-related compliance documents. By structuring data, we're equipping privacy offices with the ability to more quickly search through these documents, reducing unnecessary manual practices and laying a foundation for them to more easily collaborate with engineering teams.
This project is funded by 10x.
Privacy Dashboard development repo here
Our phase three work is happening in partnership with the GSA's Privacy Office.
The scraping code is written in Python and runs locally. We recommend creating a virtual environment using virtualenv to install and manage the required Python libraries. Run these commands in the repository directory on your machine to create a local virtual environment, start it, and then install all requirements.
virtualenv .venv
source .venv/bin/activate
pip install -r requirements.txt
Running python sorn_scraper.py
does the following:
gsa_sorns.csv
with one row per system.