DSCI-310-2024 / DSCI-310_Group-Project_Group8_Purchasing-Intent-Analysis

MIT License
0 stars 1 forks source link

EDA + Data Analysis Issue (fix feedback issues form Milestone 1) #30

Closed sanash43 closed 5 months ago

sanash43 commented 5 months ago
sanash43 commented 5 months ago

All these changes are in the Milestone1.ipynb file in the project folder, have implemented changes to the following scripts:

sanash43 commented 5 months ago

To test the scripts that have already been fixed:

make sure you're running the container: docker-compose pull docker-compose run --rm project-image bash

Your command line should look like: jovyan@5c76e9165ffb:

Navigate to the src folder: `cd src

To test the scripts, enter the following into the command line in order:

nourshawk commented 5 months ago

updated commands to test the scripts:

read_data.py: python read_data.py 468 ../data/raw_features.csv ../data/raw_targets.csv cleaning.py: python cleaning.py ../data/x_train.csv ../data/y_train.csv ../data/cleaned_features.csv ../data/cleaned_targets.csv data_split.py: python data_split.py ../data/cleaned_features.csv ../data/cleaned_targets.csv ../data/x_train.csv ../data/x_test.csv ../data/y_train.csv ../data/y_test.csv eda_figures.py: python eda_figures.py ../data/cleaned_features.csv ../data/cleaned_targets.csv ../img/figure preprocessing.py: python preprocessing.py ../data/x_train.csv ../data/x_test.csv ../data/y_train.csv ../data/y_test.csv ../data/preprocessed_train_data.csv ../data/preprocessed_test_data.csv analysis.py: python analysis.py ../data/preprocessed_train_data.csv ../data/preprocessed_test_data.csv ../results

sanash43 commented 5 months ago

all scripts and makefile have been fixed, quarto report runs fine now