Closed tangyl92 closed 10 months ago
@tangyl92 It looks great. Just that I think the pd.readcsv is not working since the csv file is inside the data folder. Also in order to make our analysis reproducible I think we could use import in python? (https://archive.ics.uci.edu/dataset/697/predict+students+dropout+and+academic+success)
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo
predict_students_dropout_and_academic_success = fetch_ucirepo(id=697)
X = predict_students_dropout_and_academic_success.data.features y = predict_students_dropout_and_academic_success.data.targets
@billwan96 Hi Bill, thank you for noticing that. Actually I have change the path as following: "student_df = pd.read_csv('../data/student.csv')" '.. ' means go to parent directory
I finished model optimization by using PCA and feature importance value in RF