WayScience / phenotypic_profiling

Machine learning for predicting 15 single-cell phenotypes from cell morphology profiles
Creative Commons Attribution 4.0 International
2 stars 3 forks source link

Adding CP features to ggplot visualization #24

Closed gwaybio closed 1 year ago

gwaybio commented 1 year ago

Also adding CP+DP features and the F1 score notebook

review-notebook-app[bot] commented 1 year ago

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

gwaybio commented 1 year ago

Results summary

Feature space Number of top scoring phenotypes
CP_and_DPtest 10
CPtest 4
DPtest 2
image

F1 Score results

f1_scores_multiclass

PR Curve results

pr_curves_multiclass

gwaybio commented 1 year ago

@roshankern - I also had a question: what is Weighted? Should I include this class in the F1 score bar chart?

roshankern commented 1 year ago

@roshankern - I also had a question: what is Weighted? Should I include this class in the F1 score bar chart?

Weighted refers to the weighted F1 score (the mean F1 score that takes into account occurrences of each label). I think this is good to include as it gives an idea of the overall performance of the model.

roshankern commented 1 year ago

Another small comment: consider changing the final pr curve figure to 3 rows x 5 comments instead of 4x4 format. I think this would look better for a final figure, especially since there are 15 models. This is nitpicky, so feel free to ignore.

roshankern commented 1 year ago

There might be something to address in this PR now that the threshold values changed for multiclass models in #26.

gwaybio commented 1 year ago

Another small comment: consider changing the final pr curve figure to 3 rows x 5 comments instead of 4x4 format. I think this would look better for a final figure, especially since there are 15 models. This is nitpicky, so feel free to ignore.

This is a very good suggestion.

I've also implemented all other suggestions, I am going to merge!

gwaybio commented 1 year ago

Weighted refers to the weighted F1 score (the mean F1 score that takes into account occurrences of each label). I think this is good to include as it gives an idea of the overall performance of the model.

One last thing - i changed this label to increase clarity