something about submit and reset

stanfordnmbl / osim-rl

Reinforcement learning environments with musculoskeletal models

MIT License

894 stars 249 forks source link

There is something confused me about the example of sbmit.py.

` while True:

print(observation)

[observation, reward, done, info] = client.env_step(env.action_space.sample().tolist())

if done:

    observation = client.env_reset()

    if not observation:

        break`

why should be env_reset when the condition is done? should‘t it be

` while True:

print(observation)

[observation, reward, done, info] = client.env_step(env.action_space.sample().tolist())

if done:

        break`

if then, the video of submition will only be once

stanfordnmbl / osim-rl

something about submit and reset #161