mws262 / qwop-controls

0 stars 0 forks source link

Add smarter controller to UCB rollout policy #7

Closed mws262 closed 5 years ago

mws262 commented 5 years ago

UCB rollout policy takes random actions. It would be cool to use the sort-of-bad-but-better-than-random neural network controller as the rollout controller. At very least, anything better than random.

mws262 commented 5 years ago

This is now possible with the RolloutPolicy abstract class. I'm keeping this open since this is still a focus.

mws262 commented 5 years ago

Good enough.