Open leejaehoon1830 opened 8 months ago
Hi, If I'm correct, the authors used GPT-4 model to get the training data, check paper for more specific details.
I know the dataset used for training the critic model in GPT-4, but I want to know about the dataset used for the generation model.
I am curious whether you only used llama2 13b to create data when generating your generation model's training data, or if you also used llama2 7b to generate generation model's training data.