Open MurakamiTestudo opened 9 months ago
For the special hyperparameters like alpha, beta, w_share_in_train, thetas_lr, and train_thetas_from_the_epoch, could you provide some examples or best practices on how to set these values? How do they affect the training and the final model?
For the special hyperparameters like alpha, beta, w_share_in_train, thetas_lr, and train_thetas_from_the_epoch, could you provide some examples or best practices on how to set these values? How do they affect the training and the final model?