facebookresearch / mbrl-lib

Library for Model Based RL
MIT License
952 stars 154 forks source link

Change SAC library to use the implementation written by pranz24 #142

Closed luisenp closed 2 years ago

luisenp commented 2 years ago

Types of changes

Investigates the performance of MBPO when using this SAC implementation.

Motivation and Context / Related issue

As mentioned in #138, this library is reported to have better results than the one used previously.

How Has This Been Tested (if it applies)

Checklist

luisenp commented 2 years ago

From my first runs, the new SAC library improves results in cheetah and inverted pendulum, but not in the other domains yet. See plots below and compare with our results in the paper (still lower than orig. MBPO, but better than before).

imageimage

I didn't individually tune for all domains, do that's the next thing I'll try.