zapper-95 / Coup-RL

Models trained to play the card game Coup
0 stars 0 forks source link

Feature - Set up tests for desired behaviour #29

Closed zapper-95 closed 5 months ago

zapper-95 commented 6 months ago

The easiest way to test the performance of a model currently is to play it against a random agent. This is not necessarily indicative of a better model since only challenging it would yield a good model.

Instead, it would be good to analytically understand if the agent performs well in particular circumstances. For certain states, there is an action that guarantees a win. It would be good to get a value indicating if the agent picks these particular actions across many scenarios.