Open g-jing opened 4 years ago
It is really great work. I wonder if you could share the hyperparameter that is used to pre-train the DialoGPT, especially the hyperparameters for GPT-small
Which parameter are you referring to?
It is really great work. I wonder if you could share the hyperparameter that is used to pre-train the DialoGPT, especially the hyperparameters for GPT-small