Eclectic-Sheep / sheeprl

Distributed Reinforcement Learning accelerated by Lightning Fabric
https://eclecticsheep.ai
Apache License 2.0
313 stars 33 forks source link

SAC-AE #21

Closed belerico closed 1 year ago

belerico commented 1 year ago

This PR implements a SAC algorithm working with images. It is inspired by https://arxiv.org/abs/1910.01741

belerico commented 1 year ago

I've trained a Walker-Walk agent with the following:

PYOPENGL_PLATFORM="" MUJOCO_GL=osmesa CUDA_VISIBLE_DEVICES="2,3" lightning run model --devices=2 --accelerator=gpu sheeprl.py sac_pixel_continuous --env_id=dmc_walker_walk --action_repeat=2 --capture_video --sample_next_obs --checkpoint_every=50000 --seed=1

obtaining: image image