A decoder-only architecture is being used, but right-padding was detected! For correct generation results, please set padding_side='left' when initializing the tokenizer.
I set the padding side to left before training and after, but it did not change
A decoder-only architecture is being used, but right-padding was detected! For correct generation results, please set
padding_side='left'
when initializing the tokenizer.I set the padding side to left before training and after, but it did not change