Closed CoolColoury closed 2 months ago
I noticed that the composition of the datasets in the prepare_dataset.ipynb file is different: some datasets have background field and some do not.
prepare_dataset.ipynb
background
At the same time, I found that background is a required field in encode_with_chat_format_finetune when instruction-tuning.
encode_with_chat_format_finetune
I would like to ask how you deal with it specifically, thanks!
Hi, for the datasets without natural context, we conduct retrieval and fetch the top-1 doc as the context.
I noticed that the composition of the datasets in the
prepare_dataset.ipynb
file is different: some datasets havebackground
field and some do not.At the same time, I found that
background
is a required field inencode_with_chat_format_finetune
when instruction-tuning.I would like to ask how you deal with it specifically, thanks!