pranz24 / pytorch-soft-actor-critic

PyTorch implementation of soft actor critic
MIT License
810 stars 180 forks source link

question about q_loss and alpha_loss #40

Closed xxxkxin closed 2 years ago

xxxkxin commented 2 years ago

in sac.update_parameters(), the "qf_loss" was and optimizer are associated with F.mse_loss(qf1, next_q_value), F.mse_loss(qf2, next_q_value), but for policy_loss and alpha_loss, there seems to be no association between loss function and the optimizer. Did i miss something about it?