Closed houmaolin closed 2 years ago
thanks for applying. In d4rl, I found the way for computing the normalized score. but how could I get the expert score and random score used in the picture? Is there a certain number for every task, or I should run SAC and random policy in every task to get the certain score?
Here it is. https://github.com/rail-berkeley/d4rl/blob/master/d4rl/gym_mujoco/__init__.py#L23
Thanks a lot!
Here it is. https://github.com/rail-berkeley/d4rl/blob/master/d4rl/gym_mujoco/__init__.py#L23
Are those scores same between the v2 or v0?
Currently, I'm focusing on producing benchmark results for
-v0
with 10 random seeds each to make them available for journal submission. Once we get it accepted, we would work on-v2
benchmarking.