google / brax

Massively parallel rigidbody physics simulation on accelerator hardware.
Apache License 2.0
2.36k stars 255 forks source link

Short-horizon actor-critic #247

Open EelcoHoogendoorn opened 2 years ago

EelcoHoogendoorn commented 2 years ago

It occurred to me that this recent paper is an interesting one to implement inside brax

One of the cool things about brax is its differentiability, but as I understand it, attempt to leverage that thus on the type of environments that brax includes has not been very fruitful thus far; but this paper seems to quite nicely bridge that gap between diff-physics and RL. It makes a lot of sense to truncate trajectories with a learned critic.

EelcoHoogendoorn commented 2 years ago

Didnt see it mentioned in the paper and google failed me at first as well; but the project page actually does link to a pytorch implementation.