idriss-hamadi commented 1 month ago

3 provides

idriss-hamadi commented 1 month ago

yes i double checked , by repeating some code i caused some data leakage which caused the model to test on data it already have seen, now that i changed the approach, i will make a PR with another notebook

idriss-hamadi commented 1 month ago

now that i added a new file, i read from a comment that the rows with label 1 are generated with a simulator, so i dropped the rows of it and continued working with the other 3 remaining targets, i added more pre-processing functions and different modeling, so far for this the model has on average 72% accuracy

i also tried different approach of one vs all approach , when i try to predict if this sequence is generated by a specific class or not (binary classification)

i had average results of

77% accuracy when predicting 4 vs all 84% accuracy when predicting 3 vs all 84% accuracy when predicting 2 vs all

so I'm currently working on lowering the rate of misclassification in the model

any review would be appreciated

dorahacksglobal / qc-classifier

new classifier #8

3 provides