Closed chingandy closed 5 years ago
Thank you for sharing your ppo implementation on this repository.
However, I have tried to run your code ppo_continuous.py and I figured that the average reward was not increasing at all. Doesn't that mean the model is not learning?
ppo_continuous.py
Hey, I updated the repo with a bug fix. Try again and let me know.
Now the model seems to work now. Thank you for your update.
Thank you for sharing your ppo implementation on this repository.
However, I have tried to run your code
ppo_continuous.py
and I figured that the average reward was not increasing at all. Doesn't that mean the model is not learning?