machine-intelligence / rl-teacher-atari

(This repository is no longer being maintained.) Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for efficiently collecting human feedback.
MIT License
26 stars 6 forks source link