issues
search
mrahtz
/
learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
MIT License
304
stars
67
forks
source link
Create LICENSE
#1
Closed
mrahtz
closed
6 years ago