sql-hkr / offline-rl-research

オフライン強化学習用フレームワーク及びSCQL,SCQL+Dの実装
MIT License
0 stars 0 forks source link

WandB.logの修正 #1

Closed sql-hkr closed 1 year ago

sql-hkr commented 1 year ago

修正前

wandb.log(
    {
        "q1_pred": q1_pred.mean(),
        "q_target": q_target.mean(),
        "q1_ood_pred": q1_ood_pred.mean(),
        "pesudo_q_target": pesudo_q_target.mean(),
        "actor_q": q_new_actions.mean(),
        "q1_loss": qf1_loss,
        "q2_loss": qf2_loss,
        "policy_loss": policy_loss,
        "vae_loss": vae_loss,
    },
    step=self._current_epoch,
)

修正後

wandb.log(
    {
        "q1_pred": q1_pred.mean(),
        "q_target": q_target.mean(),
        "q1_ood_pred": q1_ood_pred.mean(),
        "pesudo_q_target": pesudo_q_target.mean(),
        "actor_q": q_new_actions.mean(),
        "loss/q1": qf1_loss,
        "loss/q2": qf2_loss,
        "loss/policy": policy_loss,
        "loss/vae": vae_loss,
    },
    step=self._current_epoch,
)