ignore question tokens in finetune.py

allenai / open-instruct

Apache License 2.0

1.21k stars 166 forks source link

Hi, you will then be training the model to generate both questions and answers! Typically, people only want the model to learn how to answer questions, not generate them, so they just train on the generated answer. Prior work has mixed results on whether additionally training on the questions is useful: QLora found that it slightly hurt performance (https://arxiv.org/abs/2305.14314), while other work has suggested it might be helpful (https://arxiv.org/abs/2405.14394). We went for the more common option of just training on the answers alone, but this is something that you could play with if it's interesting to you!

allenai / open-instruct

ignore question tokens in finetune.py #180