Return and cost are constant as the number of training rounds increases

liuzuxin / OSRL

🤖 Elegant implementations of offline safe RL algorithms in PyTorch

https://offline-saferl.org

Apache License 2.0

178 stars 12 forks source link

Closed ZhengyeHan closed 1 year ago

ZhengyeHan commented 1 year ago

After running train__cdt.py successfully with my own env，results are as follows:

it's strange that return and cost are constant as the number of training rounds increases.May you tell me what cause this problem?

ZhengyeHan commented 1 year ago

Sorry, I solved this issue by adjusting it in the env reset（） function, please delete this issue