duyminh1998 / pycmo

Command Modern Operations Learning Environment
Other
49 stars 23 forks source link

Train an agent to solve floridistan using PyCMO Gym #60

Open duyminh1998 opened 9 months ago

duyminh1998 commented 9 months ago

Why

As a

user of PyCMO

I want

to train an RL agent to solve the floridistan scenario using the PyCMO Gym environment provided in update 1.4.0

So that

I finally have RL agents in CMO

Acceptance Criteria

Given

we currently only have a scripted agent solving floridistan

When

we refine gym_demo.py and train an RL agent to solve floridistan

Then

we will prove that PyCMO can be used to train RL agents for CMO

Notes

duyminh1998 commented 9 months ago

PettingZoo baselines

duyminh1998 commented 9 months ago

Training using REINFORCE for Mujoco