datamol-io / splito

Machine Learning dataset splitting for life sciences.
https://splito-docs.datamol.io/
Apache License 2.0
23 stars 2 forks source link

Add support for SPECTRA #15

Open cwognum opened 5 months ago

cwognum commented 5 months ago

This package implements the spectral framework for model evaluation. All you need to get started is (1) a model, (2) a dataset, and (3) a definition of sample to sample similarity! The SPECTRA package generates a series of splits with decreasing train-test similarity. Evaluating your models on these splits will give a better understanding of model generalizability. Read the preprint for more info on how this works.

See https://github.com/mims-harvard/SPECTRA and https://twitter.com/YEktefaie/status/1782449554077647054