How is "done" variable being defined in exploration task?

Hello, I observe the following code in habitat_extensions/exploration_demo.py. And I am confused how is the value of "done" being computed.

while True: obs, reward, done, info = env.step(action) if done: obs = env.reset()

Apart from the maximum number of steps per episode, is there any other condition for an agent to finish one episode? For example, if the agent has fully explored all spaces in the house before the number of experienced steps reaches the allowed maximum number of steps per episode. Will the agent finish this episode and perform env.reset()?

facebookresearch / OccupancyAnticipation

How is "done" variable being defined in exploration task? #32

Hello, I observe the following code in habitat_extensions/exploration_demo.py. And I am confused how is the value of "done" being computed.

while True: obs, reward, done, info = env.step(action) if done: obs = env.reset()