A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
4.51k
stars
471
forks
source link
update(requirements.txt): to the latest `transformers` & `deepspeed` #579
Open
maxreciprocate opened 1 year ago
This PR updates
requirements.txt
with most recent versions of dependencies.