Closed stchau4work closed 4 years ago
The configuration parameters are shown below:
I had modified the code to add-in TensorBoard support (v2.1.0) and trained in co-lab with GPU
https://github.com/stchau4work/GO-Bot-DRL/commit/7a4aaf761902632ff84fce6725f5aa33b551e821
However, from the chart, it looks like the agent is not able to get positive rewards and the average success rate is kept at zero all the time.
Could you kindly have a review and see if I am missing something?
What is the epsilon init you suggest to use?
The configuration parameters are shown below:
I had modified the code to add-in TensorBoard support (v2.1.0) and trained in co-lab with GPU
https://github.com/stchau4work/GO-Bot-DRL/commit/7a4aaf761902632ff84fce6725f5aa33b551e821
However, from the chart, it looks like the agent is not able to get positive rewards and the average success rate is kept at zero all the time.
Could you kindly have a review and see if I am missing something?