Closed dangbert closed 2 months ago
I think this file mostly answers my question https://github.com/tatsu-lab/stanford_alpaca/blob/main/train.py#L31 permalink
and this file in torchtune is interesting as well (references the above link) https://github.com/pytorch/torchtune/blob/main/torchtune/datasets/_alpaca.py permalink
Hi I'm looking to finetune an LLM using this dataset, and was wondering if there's any advice on how to format the prompt given the instruction vs input fields?
For example consider these entries:
I imagine two approaches:
I think I'll use approach 2 but would appreciate any insights or references on this topic :)