GenoML / genoml2

GenoML (genoml2) is an open source Python package. It is an automated machine learning (autoML) platform for genomics data
Apache License 2.0
27 stars 17 forks source link

Modifications to discrete training #22

Closed m-makarious closed 3 years ago

m-makarious commented 3 years ago

The experimental branch looked to solve the overfitting problem, described in issue #21. Fix is not nominating algorithms for "best algorithm" if their balanced accuracy less than or equal to 50%, |sensitivity-specificity| greater than 0.85, sensitivity equaling 0 or 1, or specificity equaling 0 or 1. If none meet these requirements, then best algorithm is nominated as previously done before (best based on the metric the user chooses to maximize) (thanks @jfcarter2358 for the help!)

Branch also includes a new way to plot the probability plots that will break up each case/class type into separate lines, this was courtesy of @mikeDTI (thanks, Mike!)

Both these features will be in the next genoml2 package release 🥳