Closed luisenp closed 2 years ago
From my first runs, the new SAC library improves results in cheetah and inverted pendulum, but not in the other domains yet. See plots below and compare with our results in the paper (still lower than orig. MBPO, but better than before).
I didn't individually tune for all domains, do that's the next thing I'll try.
Types of changes
Investigates the performance of MBPO when using this SAC implementation.
Motivation and Context / Related issue
As mentioned in #138, this library is reported to have better results than the one used previously.
How Has This Been Tested (if it applies)
Checklist