Open pszmk opened 2 months ago
idea: run 3 phases with different params sequentially, not one combined. the losses won't be smooth, but there will be easier to control runs.
they still might be smooth if one wants is as it is implemented as well, but yeah I'll extend the run to have possibility of specifying a list of configs for each "phase"
early stopping to save on compute would be great