OpenMined / CampX

Tensor Based Environment Framework for Training RL Agents - Pre Alpha
8 stars 0 forks source link

Add random agent to Demo 5 #18

Open korymath opened 6 years ago

korymath commented 6 years ago

For Demo 5: Boat Race Example.ipynb, it might be illustrative to add a purely random agent to compare policies.

Also, by implementing the random agent, we might see an action bias in the policy defined inside if, for instance, we don't use random tie breaking between argmax actions.

It is not immediately clear to me how the action 0 or 1 is being selected given the action calculation line:

action = tdist.data - torch.cat([torch.zeros(1), tdist.data[:-1]])