Open asmith26 opened 11 months ago
Hmm, maybe I should write up a FAQ at some point since this is a very common question.
There's no easy answer, RL is still more like an art rather than science.
Normalizations typically help, especially when reward or observation scales are not tuned correctly. But it can hurt in some cases. Try disabling all normalization.
You might also just not be training enough. What is your framerate and how long are your training sessions? Some environments take hundreds of millions of steps to get the learning going.
It's hard to say more without knowing more about your environment or config. If you can share some details maybe I can help. Sharing your config would help too.
Hi, I've created a new environment, but I'm struggling to determine if the RL agent is learning correctly. It feels like it isn't improving much, thus am wondering if I have implemented the environment correctly.
Just wondering if you have any tips regarding how I might best check everything is implemented correctly? E.g
Many thanks for any help, and for this amazing lib! :)