Include model fit times in learning curves

EducationalTestingService / skll

SciKit-Learn Laboratory (SKLL) makes it easy to run machine learning experiments.

Other

550 stars 69 forks source link

Last year, scikit-learn added functionality to include model fit times when computing learning curves since – in addition to the model's performance – it's also quite useful to know how the long the model takes to train as more training data was added. This PR now adds the same functionality to SKLL.

The skll.utils.train_and_score() function now measures the model fit time for every model trained as part of a learning curve experiment.
We now generate two plots for each featureset for a learning_curve experiment. The first is the usual "score curve" that shows the training and cross-validation scores as more training data is added. The newly-added second plot is a "time curve" that shows how the model fit times change as more training data is added. The format for this new curve's name is: <experiment>_<featureset>_times.png.
The model fit times show in the time curve are first averaged over all runs with the same training data size and then averaged over all output metrics (if multiple ones are specified), making the estimates a bit more smooth.
While the score curve is faced across both rows (output metrics) and columns (learners), the time curve is only faceted along columns (learners) since we already averaged over the metrics.
I refactored the skll.experiments.output.generate_learning_curve_plots function. It now only pre-processes the score and time data to create data frames. The two curves (score and time) are now generated by two private functions: skll.experiments.output._generate_learning_curve_score_plots and skll.experiments.output._generate_learning_curve_time_plots.
Updated existing tests to allow for the refactoring and to ensure that the new plots are checked.
Documentation has been updated to show the time curve in addition to the time curve. I modified the existing plot to show a more realistic example.

As always, the best way to review is to try this out in the examples. As a starting point, if you want to replicate the same example, you can modify the Titanic example's learning_curve.cfg file as shown below and then look at the Titanic_Learning_Curve_all.png and Titanic_Learning_Curve_all_times.png files in the output directory.

This PR closes #556.

Codecov Report

Patch coverage: 100.00% and project coverage change: +0.05 :tada:

Comparison is base (143ff09) 95.19% compared to head (10469c3) 95.24%.

Additional details and impacted files

```diff @@ Coverage Diff @@ ## main #745 +/- ## ========================================== + Coverage 95.19% 95.24% +0.05% ========================================== Files 29 29 Lines 3538 3578 +40 ========================================== + Hits 3368 3408 +40 Misses 170 170 ``` | [Impacted Files](https://app.codecov.io/gh/EducationalTestingService/skll/pull/745?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=EducationalTestingService) | Coverage Δ | | |---|---|---| | [skll/experiments/\_\_init\_\_.py](https://app.codecov.io/gh/EducationalTestingService/skll/pull/745?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=EducationalTestingService#diff-c2tsbC9leHBlcmltZW50cy9fX2luaXRfXy5weQ==) | `94.69% <100.00%> (ø)` | | | [skll/experiments/output.py](https://app.codecov.io/gh/EducationalTestingService/skll/pull/745?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=EducationalTestingService#diff-c2tsbC9leHBlcmltZW50cy9vdXRwdXQucHk=) | `97.86% <100.00%> (+0.40%)` | :arrow_up: | | [skll/learner/\_\_init\_\_.py](https://app.codecov.io/gh/EducationalTestingService/skll/pull/745?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=EducationalTestingService#diff-c2tsbC9sZWFybmVyL19faW5pdF9fLnB5) | `97.18% <100.00%> (ø)` | | | [skll/learner/utils.py](https://app.codecov.io/gh/EducationalTestingService/skll/pull/745?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=EducationalTestingService#diff-c2tsbC9sZWFybmVyL3V0aWxzLnB5) | `93.37% <100.00%> (+0.05%)` | :arrow_up: | | [skll/learner/voting.py](https://app.codecov.io/gh/EducationalTestingService/skll/pull/745?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=EducationalTestingService#diff-c2tsbC9sZWFybmVyL3ZvdGluZy5weQ==) | `98.54% <100.00%> (ø)` | |

EducationalTestingService / skll

Include model fit times in learning curves #745

Codecov Report