tidymodels / tune

Tools for tidy parameter tuning
https://tune.tidymodels.org
Other
273 stars 42 forks source link

helper to provide minimal information for searches taking `initial` results #690

Open simonpcouch opened 1 year ago

simonpcouch commented 1 year ago

The initial argument to tune_bayes(), tune_sim_anneal(), etc(?) can be expressed as either 1) a previous tuning result or 2) a number indicating the number of initial grid points to evaluate. In case 1), when initial searches result in a large object—as a result of large input data, many grid points, etc—that I know I will want to pass along to another tuning function later (i.e. in another session), I've often wished there were some way to "butcher" the object so that it only contains the information that will be extracted when it's supplied as initial. That "butcher"ing process may even be collect_metrics(), in which case we could just add another initial parameterization.

simonpcouch commented 1 year ago

This may actually be an issue with serialization—would be worth comparing object-in-memory vs. on disk sizes to ensure that rsample serialization tricks carry over inside of tune objects. If not, some notes from bringing this up in team meeting: