Closed danielhavir closed 1 year ago
The entropy maximizing term is inspired by this torch implementation of SAC: https://github.com/pranz24/pytorch-soft-actor-critic/blob/master/sac.py
The entropy maximizing term is inspired by this torch implementation of SAC: https://github.com/pranz24/pytorch-soft-actor-critic/blob/master/sac.py