Calculate sample efficiency the way they do with the leaderboard

cesare-spinoso / HopperMujoco

1 stars 0 forks source link

Closed cesare-spinoso closed 2 years ago

cesare-spinoso commented 2 years ago

So they use a very dumb way of calculating sample efficiency, average rewards observed not AUC

@sabina-elkins we might need to update the evaluation script to add this

sabina-elkins commented 2 years ago

oof dummies. I will add this to my todo list

sabina-elkins commented 2 years ago

wait, don't we already calculate this? it's just the mean reward during trainig