There is an issue in naming between lead_preprocessed_data and historical_preprocessed_data. We must distinguish between both. Currently the code only names the output as preprocessed_data without any distinction.
Acceptance Criteria
"data preprocessing" step can be applied on both lead and historical data but will use a different output file name for each case
Training uses only historical data
Merchant Size Prediction should use lead_preprocessed_data for prediction
All the previous should work with both DB types: Local and S3
There is an issue in naming between
lead_preprocessed_data
andhistorical_preprocessed_data
. We must distinguish between both. Currently the code only names the output aspreprocessed_data
without any distinction.Acceptance Criteria
lead_preprocessed_data
for predictionAll the previous should work with both DB types: Local and S3