Closed Ahmed-Radwan094 closed 5 months ago
If code there is, it is minimal and working
Closing because the minimum requirements for seeking help are not met.
This also look like tech support, which we don't do.
Unfortunately, I cannot share the code. However, thank you for your support on other tickets.
❓ Question
I implemented a custom environment in Carla (discuss and verified working in previous ticket) and trying to train PPO agent in it. I noticed that the policy gradient loss and explained variance are always very small, while the value loss can have very high peaks (maximum is around 200). The final agent performance is bad (almost random sampling). Can you maybe guide me what could be the reasons behind such behavior and how I can overcome it?
Hyperparameters used:
Checklist