After running the preprocessing script, it would be useful to only output the files we need to do the training, without all the temporary files that were generated by the preprocessing.
This could be done by creating another folder called data_preprocessed_clean inside the preprocessing script.
After running the preprocessing script, it would be useful to only output the files we need to do the training, without all the temporary files that were generated by the preprocessing.
This could be done by creating another folder called
data_preprocessed_clean
inside the preprocessing script.