Open duyminh1998 opened 11 months ago
user of PyCMO
to train an RL agent to solve the floridistan scenario using the PyCMO Gym environment provided in update 1.4.0
I finally have RL agents in CMO
we currently only have a scripted agent solving floridistan
we refine gym_demo.py and train an RL agent to solve floridistan
gym_demo.py
we will prove that PyCMO can be used to train RL agents for CMO
PettingZoo baselines
Training using REINFORCE for Mujoco
Why
As a
user of PyCMO
I want
to train an RL agent to solve the floridistan scenario using the PyCMO Gym environment provided in update 1.4.0
So that
I finally have RL agents in CMO
Acceptance Criteria
Given
we currently only have a scripted agent solving floridistan
When
we refine
gym_demo.py
and train an RL agent to solve floridistanThen
we will prove that PyCMO can be used to train RL agents for CMO
Notes