nci / scores

Metrics for the verification, evaluation and optimisation of forecasts, predictions or models.
https://scores.readthedocs.io/
Apache License 2.0
50 stars 15 forks source link

Get isotonic regression working better with Dask #173

Open nicholasloveday opened 5 months ago

nicholasloveday commented 5 months ago

Currently data is brought into memory in the isotonic regression calculation. We should see if we can avoid this.

nicholasloveday commented 5 months ago

It's not immediately clear how this can be done since the output is a dictionary of numpy arrays. This will require a major rewrite to replace the existing function. The alternative is that we create a second, simpler version that just works for targeting the mean.

It's also worth noting that scipy has an implementation which we should compare with the sklearn version.