Closed roshankern closed 1 year ago
Thank you for the review @d33bs!
Notebook files 2.train_model/train_model.ipynb
and 4.interpret_model/interpret_model.ipynb
should not have .py
files in this PR because their python files had no changes to be tracked in this PR (the Jupyter files were just rerun).
Accidentally merged this PR without approval, but @d33bs and I discussed that everything was good to merge.
This PR is ready for review! This PR incorporates a newer version of mitocheck_data, downloading the 2015 MitoCheck dataset and merging it with the older dataset. The pipeline is then rerun with this expanded dataset.
The "holdout" dataset is also removed, leaving only the training and testing data subsets. Our logic here is that the application of the final phenotypic profiling model to other datasets (ex Cell Health) will validate the model in the same way a holdout dataset would.