Question about training

OpenBMB / Eurus

Apache License 2.0

287 stars 15 forks source link

Question about training #13

Closed xu1998hz closed 2 months ago

xu1998hz commented 2 months ago

I have a quick question regarding to the training data of Eurus. Do you directly use instruction as input and response as output shown in HF (openbmb/UltraInteract_sft) or you have some other formatting strategies. Because I found prompts in evaluation code is quite different from training data.

lifan-yuan commented 2 months ago

Hi,

yes, we directly trained models with the prompt format as shown in HF. The prompt in training data and eval code should be the same for math. However, for coding, we modified the eval prompt based on Qwen repo because the output formats greatly affected eval results. Thus more specific instructions are needed for correctly formatted responses.

xu1998hz commented 2 months ago

Thanks a lot for your prompt response!