issues
search
CarperAI
/
trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
MIT License
4.51k
stars
471
forks
source link
fix(examples/hh): old gpt-j checkpoint loading
#552
Closed
maxreciprocate
closed
1 year ago