nlpxucan / WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
9.19k stars 712 forks source link

Which LLM generated the WizardCoder data? GPT-4 or GPT-3.5? #120

Open Symbolk opened 1 year ago

Symbolk commented 1 year ago

Here someone re-implemented the Evol-Instruct data generation process: https://github.com/swtheing/WizardCoder_Instruct_Generator, where he used gpt4_azure API to generate in-depth data.

However, it is not clear in the paper which LLM actually generated the WizardCoder data? Given that for researchers inside Microsoft, the GPT-4 API should be available via Azure back in Feb. or Mar. this year, I guess there is no reason to use GPT-3.5 instead of GPT-4?