Closed scott-yj-yang closed 3 months ago
Before we look further, can you try the same but without the .index
at the end of the checkpoint name?
Thank you so much for your quick reply! That fixed the issue! I can't believe the issue is the checkpoint file name lolll. I guess I am not so familiar with the tf
's checkpoint naming system. Anyway, thanks for your help!
Hi Team,
When I tried to load checkpoint to continue training via the
DMPOConfig.checkpoint_to_load = {run_name}/checkpoints/dmpo_learner/ckpt-4.index
, yielded the followingAssertionError
insidecreate_dual_variables_once
.This error message from
tf.Variable
is not particularly helpful since I cannot find any actual asserts to fix my code. However, the training can be run without loading the checkpoint.Looking deeper into the checkpoint loading logic, I don't quite understand the logic at the this line, where the variable
_checkpoint
is created and restored but never used.Could you help me with this issue? Any clarification and guidance is greatly appreciated!
Scott Yang.