issues
search
CarperAI
/
trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
MIT License
4.51k
stars
471
forks
source link
Add default config LLaMa 2 converter Nemo
#562
Closed
PhungVanDuy
closed
1 year ago