issues
search
hangxu0324
/
Capstone-Project
Fall 2022 Columbia DS Capstone Project
0
stars
0
forks
source link
Week 6 progress track
#13
Open
JasonSqz
opened
1 year ago
JasonSqz
commented
1 year ago
[x] Separate feature engineering and modeling into to two Colab notebook (Senqi)
[x] Change labels to growth/maturity and retrain a random forest model (Senqi)
[x] Balance training/validation set to match the potential distribution of test set (Zehui)
[x] Data Split: Design new function to split data based on clusters (Zehui)
[x] Data Augmentation: Add randomly generated "mature" label data to solve imbalance
[x] Sanity check DNN model performance (Senqi)
[x] Evaluation: design a new function to compare performances of models(Shuyue)
JasonSqz
commented
1 year ago
@hangxu0324 @zehuiwu @shuyuexu @JasonSqz @yajiez11