Open heping103 opened 2 months ago
Hi, thanks for reaching out. This can be many several reasons off the top of my head, but I cannot say much unless I know more about the task you want to train on. I know it's been a few weeks since you've posted this but if you still have questions on this, feel free to email me!
I customized an environment and trained it with the PPO algorithm,Why does my strategy suddenly collapse as the model is trained? Is this a problem with my environment? Or is it a common problem in reinforcement learning? How do I fix it?Thank you for your teaching and look forward to receiving a response。