codeforboston / clean-slate-data

MIT License
27 stars 13 forks source link

Consolidate data pipeline scripts #180

Open laurafeeney opened 3 years ago

laurafeeney commented 3 years ago

]Create single notebook / script for the data flow from deidentified-but-still-raw data to ‘prosecution_charges_detailed’. Right now, prosecution_charges is both an input and output of two different scripts, without a clear indication of what should be run first. Would be helpful to just condense those steps into a single script.

The general pipeline is in the readme in the /notebooks page.

Thoughts on how to do this are also drafted here: Procedure for adding new MA prosecution data

linnalihe commented 2 years ago

@agathaalmunir , @mknotts623 , and @linnalihe will review the scripts and write out a summary what the scripts are doing / did. Date to work on this - Thursday 8/26/2021