Closed thomasdhc closed 1 month ago
Add model config override for steerlm2 and sft
# Add a code snippet demonstrating how to use this
Pre checks:
max_steps=-1
validation
What does this PR do ?
Add model config override for steerlm2 and sft
Changelog
Usage
Before your PR is "Ready for review"
Pre checks:
Checklist when contributing a new algorithm
max_steps=-1
andvalidation
?Additional Information