CityBrainChallenge / KDDCup2021-CityBrainChallenge-starter-kit

77 stars 40 forks source link

Policy gradient method #42

Open john9636 opened 3 years ago

john9636 commented 3 years ago

DQN was provided as an example. Why was the policy gradient method not provided? Does it work on these problems?

Kanstarry9T commented 3 years ago

Even we provide a set of benchmark solutions (agent_DQN.py, agent_MP.py), but you are encouraged to use any other RL or non-RL methods.