Closed weiguowilliam closed 2 years ago
@weiguowilliam Thanks for reporting this issue! When stack=True
, the first 3 observations (frame_stack=4) has the black channels to fill the frames since there is no past observations.
https://github.com/takuseno/d4rl-atari/blob/8d1d3ff621d822a65adc2227441d7c220324f445/d4rl_atari/offline_env.py#L57
There are two ideas to change this behavior
stack=True
Do you have any thoughts on this?
When I set
stack = True/False
for the same environment and get the first observation&reward&action: forstack
case, the first observation isdataset_s['observations'][0][0,:]
) forunstack
case, the first observation isdataset['observations'][0,:]
).The question is, in both cases, the reward list and the action list is same. But the observation list in stack/unstack cases are different. I attached the first observation in stack/unstack case. I wonder what the reason is? Could you please explain it? Thanks in advance.
stacked case
unstacked case
here's the code: