issues
search
mrahtz
/
learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
MIT License
301
stars
67
forks
source link
GRPC error
#14
Closed
errorer-max
closed
1 year ago