jcwleo / awr-pytorch

Advantage-Weighted Regression
MIT License
10 stars 2 forks source link

Status of this project #6

Open ManifoldFR opened 4 years ago

ManifoldFR commented 4 years ago

Hello, I am reimplementing AWR in my own PyTorch codebase, and I haven't been able to get it to work as well as in the paper (on PyBullet Gym environments). I think this is the only other implementation of AWR I have found apart from the initial TF code, but it doesn't seem to work.

Have you been able to get it to train policies well?

jcwleo commented 4 years ago

@ManifoldFR Hi, I only tested in the basic Gym environment, not in the Pybullet environment. It was well learned in the basic gym environment.