Open XiaomoWu opened 2 weeks ago
Oh on chat templates since you did a PR to clean the system message up, would you be able to investigate this one @Erland366 ? Thanks :)
Will check them out!
Hey @Erland366, did you have time to look into this? :)
It'd be great to get multi-turn chats to work with the Mistral template
@kldzj Sorry for the very late response
Actually Daniel already fix this in this discussion https://github.com/unslothai/unsloth/issues/1290#issuecomment-2478130636
Hopefully it works now .-.
Can confirm the fix linked in that thread works, but not sure it's in the latest release
I had to install the unsloth-zoo
nightly and restart my kernel
I'm using a Mistral model and want to only train on responses.
train_on_responses_only
is supposed to only mask the user prompt, however, the following code masks both the user and assistant messages.The code, particularly, the value of
instruction_part
andresponse_part
, are from #1229. I tried different varieties ofinstruction_part
andresponse_part
, such as adding a tailing\n
or space, but without success.OS: Ubuntu 24.04 Pytorch: 2.5.0 + cu124 Unsloth: 2024.11.5