Question: Why optimize loss_alpha?

pranz24 / pytorch-soft-actor-critic

PyTorch implementation of soft actor critic

MIT License

823 stars 182 forks source link

Question: Why optimize loss_alpha? #47

Open DefinitlyEvil opened 10 months ago

DefinitlyEvil commented 10 months ago

Hi, I'm fairly new to RL and recently I studied DDPG then trying to understand SAC. I see code you optimized log_alpha instead of alpha directly, is there a reason for that?

Thanks so much for you explanation. Toby.