nottombrown / rl-teacher

Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
MIT License
559 stars 95 forks source link

Only generate 2 segments per label instead of 5 #16

Closed Raelifin closed 7 years ago

Raelifin commented 7 years ago

More efficient. Doesn't matter much once my other PR gets merged, but it was bugging me.

nottombrown commented 7 years ago

lgtm. Applied on master