Closed gglsmm closed 1 week ago
Probably a duplicate of https://github.com/DLR-RM/stable-baselines3/issues/568 and https://github.com/DLR-RM/stable-baselines3/issues/308
Thank you for your response. in #568, I assume it is for after training.
My question is I want to log Q-values during the training process. For example, we observe loss or episode reward mean during the training on Tensorboard and I want to see Q-values if possible. See attached it is for Pong game. I am just not sure if this code is right.
I assume it is for after training.
why?
See attached it is for Pong game. I am just not sure if this code is right.
it looks fine, although you should use the code example I linked and you might want to call logger.dump()
if the values are not logged often enough for you.
I assume it is for after training.
why?
I collect Q-values during training, so my callback is in model.learn(), but in this code #568, I assumed it collects Q-values after loading a trained model, if not it is my misunderstanding.
Thank you for your feedback.
❓ Question
Hello,
I am trying to log Q-values using custom callback, but I am new in this field and not sure the code below is the correct way to do it.
Checklist