Closed ZhengyeHan closed 1 year ago
After running train__cdt.py successfully with my own env,results are as follows:
it's strange that return and cost are constant as the number of training rounds increases.May you tell me what cause this problem?
Sorry, I solved this issue by adjusting it in the env reset() function, please delete this issue
After running train__cdt.py successfully with my own env,results are as follows:
it's strange that return and cost are constant as the number of training rounds increases.May you tell me what cause this problem?