Closed zkzfor closed 1 year ago
I think the reason is the step
function should return obs, state, rewards, dones, infos, None
instead of obs, state, rewards, dones, info, None
. Can you check it?
God, I was so stupid. I'm sorry to waste your valuable time on such a stupid problem. After modification, the test environment has now run for two rounds and can run stably. Thank you very much for your help. If you hadn't found my problem, I might have had to give up.
Don't worry at all! I'm really glad to hear that the issue has been resolved, and your test environment is running smoothly now. If you ever have any more questions or run into any other issues, please feel free to reach out.
I need your help again. Currently, I need to create a multi-agent environment. To test the interface, I have written a random sampling environment to confirm if my understanding of the interface is correct. However, my testing code has encountered issues. Here is my environment code:
The error message is as follows:
The configuration is as follows:
Perhaps my question is very basic, but I have been debugging for a long time and still haven't found the problem:(