Hello, thank you very much for open sourcing such a great project. I am running the code:
python experiments.py evaluate configs/IntersectionEnv/env.json using the command \
configs/IntersectionEnv/agents/DQNAgent/baseline.json \
--train --episodes=4000 --name-from-config,
the reward graph I get is unstable. I hope to get your help, thanks a lot!
Hello, thank you very much for open sourcing such a great project. I am running the code: python experiments.py evaluate configs/IntersectionEnv/env.json using the command \ configs/IntersectionEnv/agents/DQNAgent/baseline.json \ --train --episodes=4000 --name-from-config, the reward graph I get is unstable. I hope to get your help, thanks a lot!