NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment
Apache License 2.0
625 stars 78 forks source link

feat: DPO support for global padding of seq_len to a multiple #386

Closed terrykong closed 1 week ago

terrykong commented 2 weeks ago

What does this PR do ?

Needed for:

Rebase stack

Changelog

Usage

# Add a code snippet demonstrating how to use this 

Before your PR is "Ready for review"

Pre checks:

Checklist when contributing a new algorithm

Additional Information