Closed umangchaudhry closed 3 years ago
Cutoff-point – At this stage, we still have to decide where the cutoff point is. We will need to score each particle and then cross-validate some of them under a microscope. As soon as we have defined the cutoff point, it would be great to indicate for each sample how many microdebitage particles it contains.
Model comparison – It would also be useful to compare the scores of different models. Do they overlap for the same particles? For which particles do they differ?
Comparison of relevant variables – Which variables contribute most to the score in each model? Most of them emphasize transparency but they seem to differ in less-important variables. I also wonder about related variables (e.g., feret length vs fiber length) that should be of similar importance but differ in some models. At last, some variables are secondary (e.g., length-width ratio) because they are calculated from primary variables (e..g, length and width). Should they be treated differently or even be excluded from the models?
Load in df, make predictions which will output a score