What is the meaning of padding-free in ReadMe?

imoneoi / openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

https://openchat.team

Apache License 2.0

5.21k stars 397 forks source link

What is the meaning of padding-free in ReadMe? #179

Open houghtonweihu opened 7 months ago

houghtonweihu commented 7 months ago

In the readme, it says:

The OpenChat training system utilizes padding-free training and the Multipack Sampler, achieving a 3~10x speedup compared to the conventional padded training.

What is the meaning of padding-free here? Is there a need for all seqs in one batch to have the same length? If no padding, how is this done?

Thanks!

beep-bebop commented 3 weeks ago

我的感觉是把短样本拼接为一个长样本