pranz24 / pytorch-soft-actor-critic

PyTorch implementation of soft actor critic
MIT License
810 stars 180 forks source link

Question: Why optimize loss_alpha? #47

Open DefinitlyEvil opened 8 months ago

DefinitlyEvil commented 8 months ago

Hi, I'm fairly new to RL and recently I studied DDPG then trying to understand SAC. I see code you optimized log_alpha instead of alpha directly, is there a reason for that?

Thanks so much for you explanation. Toby.