duyminh1998 / pycmo

Command Modern Operations Learning Environment
Other
58 stars 24 forks source link

Train an agent to solve floridistan using PyCMO Gym #60

Open duyminh1998 opened 11 months ago

duyminh1998 commented 11 months ago

Why

As a

user of PyCMO

I want

to train an RL agent to solve the floridistan scenario using the PyCMO Gym environment provided in update 1.4.0

So that

I finally have RL agents in CMO

Acceptance Criteria

Given

we currently only have a scripted agent solving floridistan

When

we refine gym_demo.py and train an RL agent to solve floridistan

Then

we will prove that PyCMO can be used to train RL agents for CMO

Notes

duyminh1998 commented 11 months ago

PettingZoo baselines

duyminh1998 commented 11 months ago

Training using REINFORCE for Mujoco