AkihikoWatanabe / paper_notes

たまに追加される論文メモ
https://AkihikoWatanabe.github.io/paper_notes
17 stars 0 forks source link

LaMini-instruction #548

Open AkihikoWatanabe opened 1 year ago

AkihikoWatanabe commented 1 year ago

https://huggingface.co/datasets/MBZUAI/LaMini-instruction

AkihikoWatanabe commented 1 year ago

We distill the knowledge from large language models by performing sentence/offline distillation (Kim and Rush, 2016). We generate a total of 2.58M pairs of instructions and responses using gpt-3.5-turbo based on several existing resources of prompts, including self-instruct (Wang et al., 2022), P3 (Sanh et al., 2022), FLAN (Longpre et al., 2023) and Alpaca (Taori et al., 2023). More information about the process for generating our instruction dataset, please refer to our paper.

Translation (by gpt-3.5-turbo)

AkihikoWatanabe commented 1 year ago

既存のInstruction DatasetのInstructionをseedとして、gpt-3.5-turboで新たなInstructionとresponseを生成したデータセット image