nlpxucan / WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
9.11k stars 711 forks source link

How to reproduce Evol-Instruct datasets? #210

Open imoneoi opened 10 months ago

imoneoi commented 10 months ago

I've seen your open-source Evol-Instruct generation scripts. Good job!

Additionally, can you provide instructions on how to reproduce the WizardLM dataset and WizardCoder dataset using the scripts provided?

nlpxucan commented 10 months ago

You can just modify the file path you want to evol in WizardLM/Evol-Instruct/main.py and then run "python main.py". The default file is the alpaca data.

imoneoi commented 10 months ago

What is the seed file for the WizardLM and WizardCoder datasets?

imoneoi commented 10 months ago

BTW, the scripts seem to be missing the error checker and iterative evolution described in the paper. Are these parts necessary?

imoneoi commented 10 months ago

Any updates?

gantuo commented 6 months ago

BTW, the scripts seem to be missing the error checker and iterative evolution described in the paper. Are these parts necessary?

+1