Modalities / modalities

Modalities, a PyTorch-native framework for distributed and reproducible foundation model training.
MIT License
61 stars 5 forks source link

SFT sample generator #193

Closed rrutmann closed 3 months ago

rrutmann commented 3 months ago

What does this PR do?

This PR adds the possibility of NOT reusing the last target of the previous sample as the first token of the next sample during training. This is needed for Instruction Tuning since, in this case, we have to avoid overlapping training samples (in contrast to the pretraining).

In addition, we provide example configurations for preparing the data and running the training.

Checklist before submitting final PR