meta-llama / llama

Inference code for Llama models
Other
55.37k stars 9.44k forks source link

Access to SFT dataset or LLaMA2 SFT models #543

Open YihanCao123 opened 1 year ago

YihanCao123 commented 1 year ago

Hi authors,

First of all, thanks for your great work on LLaMA-2! This is an impressive work for open source large language models!

I have a question about section 3.1 in the paper, specifically "Quality is all you need" section. It mentions that when instruction tuning the base model, you first select 27,540 high quality data examples. Is it possible that you can open source these selected data or the supervised finetuned model, which does not include RLHF?

Thanks!

a-antoniades commented 10 months ago

+1 both the data and sft model would be very useful for researchers.