aws / amazon-sagemaker-examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
https://sagemaker-examples.readthedocs.io
Apache License 2.0
10.03k stars 6.75k forks source link

hpo_pytorch_mnist.ipynb notebook throws an error when creating endpoint #1544

Open aws-rbs opened 4 years ago

aws-rbs commented 4 years ago

Error message Exception: Best training job not available for tuning job: pytorch-training-200919-0048

wiltonwu commented 3 years ago

HI @aws-rbs, I was just able to run through the PyTorch HPO notebook successfully. Could you try rerunning, and if that still fails, could you include the Cloudwatch logs for the tuning job and the following details about your environment?

Studio or Regular Notebook Instance? Kernel Version

jjput1 commented 2 years ago

I'm having the same type of issue with my project. Cloud Watch: image I'm doing this in sagemaker studio with data science 1.0.