issues
search
yamatokataoka
/
learning-from-human-preferences
Replication of Deep Reinforcement Learning from Human Preferences (Christiano et al, 2017).
MIT License
2
stars
0
forks
source link
Enable numpy type checking
#11
Closed
yamatokataoka
closed
1 year ago