UBC-MDS / Credit_Card_Default_Prediction_Group13

Group 13 of DSCI 522 in the year 2022-23
https://ubc-mds.github.io/Credit_Card_Default_Prediction_Group13/doc/report.html
Other
2 stars 3 forks source link

Script usage #18

Closed ChesterAiGo closed 1 year ago

ChesterAiGo commented 1 year ago

To download our dataset, do:

python3 data_downloader.py https://archive.ics.uci.edu/ml/machine-learning-databases/00350/default%20of%20credit%20card%20clients.xls ../data/raw/

ChesterAiGo commented 1 year ago

For milestone 2, the usage of the these scripts (data_downloader.py, data_preprocessor.py, data_eda.py) would be:

python3 data_downloader.py https://archive.ics.uci.edu/ml/machine-learning-databases/00350/default%20of%20credit%20card%20clients.xls ../data/raw/

python3 data_preprocessor.py ../data/raw/raw_data.xls ../data/processed/

python3 data_eda.py ../data/processed/train_raw.csv ../data/eda_results/

Note that the UCI repo is down at the time this issue is posted so the first command would probably not work until it is back online.

ChesterAiGo commented 1 year ago

Starting in milestone 3 please run scripts in the root folder of the repo:

python3 src/data_downloader.py https://archive.ics.uci.edu/ml/machine-learning-databases/00350/default%20of%20credit%20card%20clients.xls ./data/raw/

python3 src/data_preprocessor.py ./data/raw/raw_data.xls ./data/processed/

python3 src/data_eda.py ./data/processed/train_raw.csv ./data/eda_results/

python3 src/data_analysis.py ./data/processed/train_cleaned.csv ./data/processed/test_cleaned.csv ./data/results/

ChesterAiGo commented 1 year ago

Commands for docker-related operations:

docker build --tag chesteraigo/mds_522_credit_card_group_13 .

docker push chesteraigo/mds_522_credit_card_group_13:latest

docker run --rm -v /$(pwd):/home/Credit_Card_Default_Prediction_Group13 chesteraigo/mds_522_credit_card_group_13:latest make -C '/home/Credit_Card_Default_Prediction_Group13' all