Updated preprocessing code to work with freshly downloaded WOD data. This was necessary as it was previously assumed that data is arranged per year, but this is not the case. So the updated system does the following:
reads in all the data (higher memory footprint than previously)
*, preprocesses variable where needed
creates derived variables such as year, model, manufacturer
creates a single data frame
subsets for each year present and writes out data as a CSV file
Updated preprocessing code to work with freshly downloaded WOD data. This was necessary as it was previously assumed that data is arranged per year, but this is not the case. So the updated system does the following: