nhs-bnssg-analytics / d_and_c

Scoping the possibility of predicting performance from demand and capacity metrics
1 stars 0 forks source link

Future Performance Tool

This repository contains the data download, data manipulation and the modelling that underpins the Future Performance Tool.

See a description of the project and the findings as it progresses here.

Using this repository

The repository is managed using renv. The aim is that anyone that wants to reproduce this analysis uses the same version of R that performed this analysis. After cloning and opening the project for the first time, run renv::restore() in the console to install all the versions of the packages used in this work. You may need to restart RStudio after doing this.

The file R/02_data.R should download and clean all of the data files from the internet into a tidy format. The files will be summarised into single files in a folder called data. This step takes up to 30 minutes (depending on network connection speeds) to run as the source files are numerous and some are quite large. The source files are stored within the data-raw folder to allow users to refer back to them.

A lot of the source files are spreadsheets, and generally they are created manually by different organisations and uploaded to a server for public use. As a result, filenames, sheet names and sheet formatting can be inconsistent. The code in this project attempts to negate the inconsistency, but there will be occasions where an unpredictable manual step has occurred that the code hasn’t accounted for, and will result in an error. Please report these errors through the issues page.