cesare-spinoso / HopperMujoco

1 stars 0 forks source link

Create Udemy SAC agent #37

Closed j-c-carr closed 2 years ago

j-c-carr commented 2 years ago

[Closes #36] Creates the soft-actor critic from a Udemy course. This course goes over one way to implement the original soft actor-critic algorithm. The references to (Haarnoja, 2018) reference this paper.

To test the agent, you should be run the python script (in a virtual environment) like so:

python3 train_agent.py --group udemy_sac_agent

This should automatically start training the agent and save the best model parameters every 1000 timesteps. You should be able to run the agent with pretrained weights like so:

python3 train_agent.py --group udemy_sac_agent --load /path/to/model_filename

Note that the /path/to/model_filename should not include the .pth.tar extension.

The hyperparameters are not yet tuned but after ~200k steps I was able to get a consistent score around 2700.