when using async_vector_env, the info after termination of one episode is wrong, it returns the info from reset() instead of step()
Code example
in gymnasium/vector/async_vector_env.py 642:646
if terminated or truncated:
old_observation, old_info = observation, info
observation, info = env.reset()
info["final_observation"] = old_observation
info["final_info"] = old_info
I think there should be a deepcopy for the old_observation and old_info here.
System info
gymnasium==0.29.1
Additional context
No response
Checklist
[X] I have checked that there is no similar issue in the repo
Describe the bug
when using async_vector_env, the info after termination of one episode is wrong, it returns the info from reset() instead of step()
Code example
System info
gymnasium==0.29.1
Additional context
No response
Checklist