Closed Kelvinson closed 5 years ago
This seems likely to be correct, but when is this used? Is ale.lives
ever set?
I don't know about the coinenv, but for the normal gym environment where there is no aux_rew, at the end of the game, self.long_aux_rewards[i,0] will be accessed although self.long_aux_rewards is in shape zero created at [https://github.com/Kelvinson/coinrun/blob/8ad3339286e486f4258761a7311814b7e8d00665/coinrun/wrappers.py#L54]()
NVM, maybe it's not a bug for coinenv.
when game is over, game_over_rew can access long_aux_rewards which can be potentially in shape zero at axis 1 when use_aux is False. We can first look at whether use_aux is True and then decide to access long_aux_rewards or set game_over_rew to simply zero.