issues
search
OptimalScale
/
LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
https://optimalscale.github.io/LMFlow/
Apache License 2.0
8.11k
stars
819
forks
source link
[Feature] Add PPO support
#854
Closed
wheresmyhair
closed
1 week ago
wheresmyhair
commented
2 weeks ago
Description
Add PPO Support
Pipeline Tests
WIP
Description
Add PPO Support
Pipeline Tests
WIP