pranz24 / pytorch-soft-actor-critic

PyTorch implementation of soft actor critic
MIT License
810 stars 180 forks source link

Doubts about Regularization in policy loss #39

Open Marxvans opened 2 years ago

Marxvans commented 2 years ago

Thank you for your contribution. However I'm confused about reg_loss = 0.001 * (mean.pow(2).mean() + log_std.pow(2).mean())# Regularization Loss in the code. Can you explain it, any help will be grateful.