SAC-AE - Githubissues

Eclectic-Sheep / sheeprl

Distributed Reinforcement Learning accelerated by Lightning Fabric

https://eclecticsheep.ai

Apache License 2.0

313 stars 33 forks source link

SAC-AE #21

Closed belerico closed 1 year ago

belerico commented 1 year ago

This PR implements a SAC algorithm working with images. It is inspired by https://arxiv.org/abs/1910.01741

[x] Main algo
[x] Sample next obs without saving them
[x] MemoryMappedReplayBuffer
[x] Custom CNN and DeCNN
[x] Reconstruction loss with Variational AutoEncoder

belerico commented 1 year ago

I've trained a Walker-Walk agent with the following:

PYOPENGL_PLATFORM="" MUJOCO_GL=osmesa CUDA_VISIBLE_DEVICES="2,3" lightning run model --devices=2 --accelerator=gpu sheeprl.py sac_pixel_continuous --env_id=dmc_walker_walk --action_repeat=2 --capture_video --sample_next_obs --checkpoint_every=50000 --seed=1

obtaining: