UBC-MDS / Student_Success_Predict_Group15

MIT License
0 stars 1 forks source link

model optimization #5

Closed tangyl92 closed 10 months ago

tangyl92 commented 10 months ago

I finished model optimization by using PCA and feature importance value in RF

billwan96 commented 10 months ago

@tangyl92 It looks great. Just that I think the pd.readcsv is not working since the csv file is inside the data folder. Also in order to make our analysis reproducible I think we could use import in python? (https://archive.ics.uci.edu/dataset/697/predict+students+dropout+and+academic+success)

Install the ucimlrepo package

pip install ucimlrepo

Import the dataset into your code

from ucimlrepo import fetch_ucirepo

fetch dataset

predict_students_dropout_and_academic_success = fetch_ucirepo(id=697)

data (as pandas dataframes)

X = predict_students_dropout_and_academic_success.data.features y = predict_students_dropout_and_academic_success.data.targets

tangyl92 commented 10 months ago

@billwan96 Hi Bill, thank you for noticing that. Actually I have change the path as following: "student_df = pd.read_csv('../data/student.csv')" '.. ' means go to parent directory