mal-lang / mal-simulator

Apache License 2.0
2 stars 1 forks source link

Experiment with the features provided in the observation #18

Open andrewbwm opened 7 months ago

andrewbwm commented 7 months ago

If I understood it correctly Jakob suggested that we add the C, I, A score as three values in the observation for each attack step. Mathias wanted to have the reward itself(or the value from which it is derived the C, I, or A score) be added to each attack step in the observation.