Open idriss-hamadi opened 1 month ago
yes i double checked , by repeating some code i caused some data leakage which caused the model to test on data it already have seen, now that i changed the approach, i will make a PR with another notebook
now that i added a new file, i read from a comment that the rows with label 1 are generated with a simulator, so i dropped the rows of it and continued working with the other 3 remaining targets, i added more pre-processing functions and different modeling, so far for this the model has on average 72% accuracy
i also tried different approach of one vs all approach , when i try to predict if this sequence is generated by a specific class or not (binary classification)
i had average results of
77% accuracy when predicting 4 vs all 84% accuracy when predicting 3 vs all 84% accuracy when predicting 2 vs all
so I'm currently working on lowering the rate of misclassification in the model
any review would be appreciated
3 provides