Open Ericonaldo opened 1 year ago
Seems the rlunplugged dataset is using clipped reward
Hi I met the same question. Do you know how to scale the clip reward to real reward? thanks!
Not to my knowledge. No.
Sorry for the super response. But, yes, the rewards are clipped. Also ,let me redirect you to this publication since this repository is simply relying on the dataset provided by Google. https://arxiv.org/abs/1907.04543
I find that the expert dataset has some problems. For example, for game 'asterix', I use terminal to split the trajectory, and the maximum return is only round 260. Can you please check the problem?