Closed harbecke closed 5 years ago
We are still pretty much clueless about how optimal our parameters are. I am working on adding ax. Here is a list of possible parameters for optimization:
[TRAIN] batch_size learning_rate epochs weight_decay
batch_size
learning_rate
epochs
weight_decay
[CREATE_DATA] train_samples_per_model temperature temperature_decay gamma
train_samples_per_model
temperature
temperature_decay
gamma
[REPEATED SELF TRAINING] num_data_models
num_data_models
reference_models
TODO:
added visualization notebook and intermediate saving callback saving best model is difficult and improbable to be highly beneficial
We are still pretty much clueless about how optimal our parameters are. I am working on adding ax. Here is a list of possible parameters for optimization:
[TRAIN]
batch_size
learning_rate
epochs
weight_decay
[CREATE_DATA]
train_samples_per_model
temperature
temperature_decay
gamma
[REPEATED SELF TRAINING]
num_data_models