rail-berkeley / rlkit

Collection of reinforcement learning algorithms
MIT License
2.52k stars 553 forks source link

AWAC implementation #115

Closed anair13 closed 4 years ago

anair13 commented 4 years ago

Everything is running now, but still testing AWAC and skewfit as a regression test.

vitchyr commented 4 years ago

Also, there's a bunch of files here that we could probably delete since we never use

anair13 commented 4 years ago

Here are the results of this branch on AWAC, SkewFit, and SAC: https://drive.google.com/file/d/1Qy5SYIGNwdeTHAGNjbRfuP5pSiRw8JzJ/view?usp=sharing

I went through and did some more cleanup as well. I think this is ready to merge now.