duyminh1998 / pycmo

Command Modern Operations Learning Environment
Other
51 stars 23 forks source link

Create custom rewards handler #52

Open duyminh1998 opened 9 months ago

duyminh1998 commented 9 months ago

Why

As a

user of pyCMO

I want

to be able to specify different reward models for my scenarios

So that

I can train RL agents

Acceptance Criteria

Given

we currently only export the player's side's total score as the reward

When

we implement a way for users to specify a reward model

Then

we get closer to being able to train RL agents

Notes

One idea is to create a custom RewardHandler class that gets passed into CMOEnv that can calculate the reward based on the current observation

duyminh1998 commented 8 months ago

gymnasium provides reward wrappers