Closed ChesterAiGo closed 1 year ago
For milestone 2, the usage of the these scripts (data_downloader.py, data_preprocessor.py, data_eda.py) would be:
python3 data_downloader.py https://archive.ics.uci.edu/ml/machine-learning-databases/00350/default%20of%20credit%20card%20clients.xls ../data/raw/
python3 data_preprocessor.py ../data/raw/raw_data.xls ../data/processed/
python3 data_eda.py ../data/processed/train_raw.csv ../data/eda_results/
Note that the UCI repo is down at the time this issue is posted so the first command would probably not work until it is back online.
Starting in milestone 3 please run scripts in the root folder of the repo:
python3 src/data_downloader.py https://archive.ics.uci.edu/ml/machine-learning-databases/00350/default%20of%20credit%20card%20clients.xls ./data/raw/
python3 src/data_preprocessor.py ./data/raw/raw_data.xls ./data/processed/
python3 src/data_eda.py ./data/processed/train_raw.csv ./data/eda_results/
python3 src/data_analysis.py ./data/processed/train_cleaned.csv ./data/processed/test_cleaned.csv ./data/results/
Commands for docker-related operations:
docker build --tag chesteraigo/mds_522_credit_card_group_13 .
docker push chesteraigo/mds_522_credit_card_group_13:latest
docker run --rm -v /$(pwd):/home/Credit_Card_Default_Prediction_Group13 chesteraigo/mds_522_credit_card_group_13:latest make -C '/home/Credit_Card_Default_Prediction_Group13' all
To download our dataset, do:
python3 data_downloader.py https://archive.ics.uci.edu/ml/machine-learning-databases/00350/default%20of%20credit%20card%20clients.xls ../data/raw/