Properly log agent values from pposgd synth

nottombrown / rl-teacher

Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback

MIT License

559 stars 95 forks source link

Closed nottombrown closed 7 years ago

nottombrown commented 7 years ago

I'm currently seeing empty graphs:

nottombrown commented 7 years ago

Nevermind. Working correctly