issues
search
yamatokataoka
/
learning-from-human-preferences
Replication of Deep Reinforcement Learning from Human Preferences (Christiano et al, 2017).
MIT License
2
stars
0
forks
source link
deep-learning
pytorch
reinforcement-learning
readme
Deep Reinforcement Learning from Human Preferences
Replication of
Deep Reinforcement Learning from Human Preferences (Christiano et al, 2017)
.