Closed iiiMatt closed 9 months ago
I think it's good. We could think about passing only the attributes of the dataset to the functions so that the dataset doesn't have to be copied every time we call a function. Function calls in generate_preprocessed_data
would then look like dataset[["Salutation"]] <- extract_salutations(dataset[["Name"]])
.
I think it's good. We could think about passing only the attributes of the dataset to the functions so that the dataset doesn't have to be copied every time we call a function. Function calls in
generate_preprocessed_data
would then look likedataset[["Salutation"]] <- extract_salutations(dataset[["Name"]])
.
In this case it would be unnecessary, because the dataset is so small, but good remark!
Update to the preprocessing pipeline and definitions for
extract_salutations
andinfer_age
.