Closed helizabeth1103 closed 3 months ago
Hi @helizabeth1103 , the logic is in https://github.com/google/deepvariant/blob/r1.6.1/scripts/run_deepvariant.py#L529-L559
Can you double check that you have this file:
/90daydata/pbarc/haley.arnold/AI_Model_Training/Samples/deepvariant_fulltest/output/modeltrainout/2fullindividualmodel/checkpoints/ckpt-14902/saved_model.pb
?
If you have that file, then this should be true:
use_saved_model = tf.io.gfile.exists(
_CUSTOMIZED_MODEL.value
) and tf.io.gfile.exists(f'{_CUSTOMIZED_MODEL.value}/saved_model.pb')
And then:
if use_saved_model:
logging.info('Using saved model: %s', str(use_saved_model))
You should be able to see the Using saved model
logging.
@helizabeth1103 , closing this due to no activity. Please feel free to reopen if you need further help. It looks like you have checkpoints:
/90daydata/pbarc/haley.arnold/AI_Model_Training/Samples/deepvariant_fulltest/output/modeltrainout/2fullindividualmodel/checkpoints/
And saved models both. Just trying to understand which one you are trying to use. Please reply with the outputs so we can understand the issue better.
Hello, I trained a customized model, and am now trying to test it. However, when I try to run it, it says that the model files in the checkpoint do not exist.
Here is the command I tried to run:
Here are the contents of the checkpoints folder for this training:
and finally, here are the contents of ckpt-14902:
Here is the error log file:
Can someone please help me figure out what's going on? The link provided showed a different set of files than the ones I have. Am I missing files? Is something upstream not functioning properly? I have trained models before, with the same kinds out output, and have been able to test them before. What am I missing?
Thank you for your help!
Best, Haley Arnold