Closed sql-hkr closed 1 year ago
修正前
wandb.log( { "q1_pred": q1_pred.mean(), "q_target": q_target.mean(), "q1_ood_pred": q1_ood_pred.mean(), "pesudo_q_target": pesudo_q_target.mean(), "actor_q": q_new_actions.mean(), "q1_loss": qf1_loss, "q2_loss": qf2_loss, "policy_loss": policy_loss, "vae_loss": vae_loss, }, step=self._current_epoch, )
修正後
wandb.log( { "q1_pred": q1_pred.mean(), "q_target": q_target.mean(), "q1_ood_pred": q1_ood_pred.mean(), "pesudo_q_target": pesudo_q_target.mean(), "actor_q": q_new_actions.mean(), "loss/q1": qf1_loss, "loss/q2": qf2_loss, "loss/policy": policy_loss, "loss/vae": vae_loss, }, step=self._current_epoch, )
修正前
修正後