OpenMined / CampX

Tensor Based Environment Framework for Training RL Agents - Pre Alpha
8 stars 0 forks source link

Boat Race demo #14

Closed jvmncs closed 6 years ago

jvmncs commented 6 years ago

Question on this boat race demo -- is the agent actually learning to solve the boat race environment? It seems like the running reward that's used as stopping condition doesn't take into account the episodic reward the agent is accumulating. Is this agent simply meant to demonstrate the CampX API?