matheusportela / Multiagent-RL

Multiagent reinforcement learning simulation framework - Undergraduate thesis in Mechatronics Engineering at the University of Brasília
67 stars 16 forks source link

Weights structure must map actions to feature and then to weights. #46

Closed matheusportela closed 8 years ago

matheusportela commented 8 years ago

Currently, weights is a dictionary that maps actions (or behaviors) to weights array, one for each feature, i.e. action -> [feature].

Instead, it must map an action to a feature and, then, to the weight, i.e. action -> feature -> weight. Then, should the features order change from a previous game to another one, where the policy is loaded, it'll map correctly the action to its weight.