liuzuxin / OSRL

🤖 Elegant implementations of offline safe RL algorithms in PyTorch
https://offline-saferl.org
Apache License 2.0
178 stars 12 forks source link

Return and cost are constant as the number of training rounds increases #21

Closed ZhengyeHan closed 1 year ago

ZhengyeHan commented 1 year ago

After running train__cdt.py successfully with my own env,results are as follows: image image image

it's strange that return and cost are constant as the number of training rounds increases.May you tell me what cause this problem?

ZhengyeHan commented 1 year ago

Sorry, I solved this issue by adjusting it in the env reset() function, please delete this issue