NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment
Apache License 2.0
625 stars 78 forks source link

feat: support for mcore optimizer (to enable MoE) #380

Open terrykong opened 2 weeks ago

terrykong commented 2 weeks ago

What does this PR do ?

Rebase stack

Changelog

Usage

# Add a code snippet demonstrating how to use this 

Before your PR is "Ready for review"

Pre checks:

Checklist when contributing a new algorithm

Additional Information

terrykong commented 2 weeks ago

The DPO dataset changes should stand on their own, but are needed to test the mcore opt changes for moe. If moe issues take too long to resolve, I'll break this up.