Open Blazeolmo opened 2 years ago
My question too!
That is used to solve the problem of unaligned generation caused by padding during the batch inference of the decoder-only model.
I think this must be a hard-coded warning, coming from the upstream Transformers package. No matter how hard I've tried, I can't seem to suppress the message.
What ze hell does that mean?