This branch collects together data cleaning scripts.
Proposed workflow
Make a new branch from this cleaning one. Add your scripts or changes and then merge into this branch.
When adding your scripts, remember to update the poetry dependencies.
This branch will be merged into main at a later step once all data cleaning has been centralised. The objective will be a single Jupyter notebook that walks through processing and cleaning the raw data sets, producing a data file that will be used in the modelling.
Data cleaning
This branch collects together data cleaning scripts.
Proposed workflow
Make a new branch from this cleaning one. Add your scripts or changes and then merge into this branch.
When adding your scripts, remember to update the poetry dependencies.
This branch will be merged into main at a later step once all data cleaning has been centralised. The objective will be a single Jupyter notebook that walks through processing and cleaning the raw data sets, producing a data file that will be used in the modelling.