Open lafmdp opened 4 years ago
We package HFO environment as a gym-style env and the implementation is as follow: https://github.com/lafmdp/hfo_rl_env/blob/master/utils/env_wrapper.py
Reward function is drew lessons from https://github.com/LARG/HFO/blob/master/example/sarsa_offense/high_level_sarsa_agent.py
Thanks for the pointer!
We package HFO environment as a gym-style env and the implementation is as follow: https://github.com/lafmdp/hfo_rl_env/blob/master/utils/env_wrapper.py
Reward function is drew lessons from https://github.com/LARG/HFO/blob/master/example/sarsa_offense/high_level_sarsa_agent.py