Open ChristianPala opened 3 months ago
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.
I also have the same question! @ChristianPala
Thanks for raising the question.
There seems to be a specific breaking case that has been observed here:
The solution was to force the padding side to the right. I'm not able to provide much more information.
Related: https://github.com/huggingface/trl/pull/550
I'm leaving the issue open for visibility, if anyone is able to give more information and if possible solve the problem.
Hi folks!
I've been trying to figure out why you have the warning in the code below for a while now, can you clarify?
if tokenizer.padding_side is not None and tokenizer.padding_side != "right": warnings.warn( "You passed a tokenizer with
padding_sidenot equal to
rightto the SFTTrainer. This might lead to some unexpected behaviour due to overflow issues when training a model in half-precision. You might consider adding
tokenizer.padding_side = 'right'to your code." )
Thanks!