The original dataset.
Preprocesses the data (Task 1).
Contains the preprocessed dataset. Can be loaded with: dataset <- readRDS("titanic_cleaned.Rds")
Functions used for analysis (Task 2).
Internal helper functions that are only called from utils.R (Task 2).
Documentation for utils.R, helpers.R and preprocessing.R (analysis.R only calls functions) (Task 3).
Main file that calls preprocessing.R, then calls functions from utils.R to analyse the data (Task 4).
The plots that analysis.R produces as a pdf export.
It is enough to run analysis.R as this will call preprocessing.R and generate a new titanic_cleaned.Rds which it reads the data from. The working directory has to be set to WiSe-2023-24 (the base folder of the repository) beforehand so it can find the function scripts and the csv file.
We discussed our conclusions from the data in this issue.