Farama-Foundation / Miniworld

Simple and easily configurable 3D FPS-game-like environments for reinforcement learning
http://miniworld.farama.org/
Apache License 2.0
700 stars 131 forks source link

episode_rewards in the RL demo #17

Closed yding5 closed 5 years ago

yding5 commented 5 years ago

A quick question. When I'm using the code to do some preliminary RL experiment, I found these in the main.py.

            # FIXME: works only for environments with sparse rewards
            for idx, eps_done in enumerate(done):
                if eps_done:
                    episode_rewards.append(reward[idx])

Does this only used to collect episode reward to show statistics and have nothing to do with the training part?

Does the FIXME means it intends to collect the accumulated reward of the whole episode just ended but currently it assume the reward at last step is the accumulated reward?

Thanks!

maximecb commented 5 years ago

This only affects success rate statistics

yding5 commented 5 years ago

Great, thanks! Close this now.