An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
1.73k
stars
164
forks
source link
Change run mode so that it could be ran directly in shell. #199
Closed
jovany-wang closed 5 months ago
Running it directly in shell will be complained about:
Just fix it by changing shell run mode.