ultralytics / hub

Ultralytics HUB tutorials and support
https://hub.ultralytics.com
GNU Affero General Public License v3.0
107 stars 11 forks source link

Hub freezes up with "Preparing your cloud instance..." #633

Closed oliverlibaw closed 3 weeks ago

oliverlibaw commented 1 month ago

Search before asking

HUB Component

Training

Bug

I'm unable to use Ultralytics Cloud to train a YOLO model on a custom dataset using the Hub.

The Hub gets stuck on "Preparing your cloud instance... hang tight!" After about 10 minutes, it simply refreshes to the previous screen. There's no error message.

I have a Pro account with a few $ in training credits. Screenshot 2024-04-10 at 1 16 01 PM

Environment

Ultralytics HUB Version v0.1.39 Client User Agent Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/123.0.0.0 Safari/537.36 Operating System MacIntel Browser Window Size 789 x 846 Server Timestamp 1712779695

Minimal Reproducible Example

My model training URL is https://hub.ultralytics.com/models/Ck8CJvrXbVNklZtRA0uA

My dataset is : https://hub.ultralytics.com/integrations/roboflow/datasets/oliver-libaw%2Fcats-squirrels

Thanks for any help you can provide!

Additional

No response

github-actions[bot] commented 1 month ago

👋 Hello @oliverlibaw, thank you for raising an issue about Ultralytics HUB 🚀! Please visit our HUB Docs to learn more:

If this is a 🐛 Bug Report, please provide screenshots and steps to reproduce your problem to help us get started working on a fix.

If this is a ❓ Question, please provide as much information as possible, including dataset, model, environment details etc. so that we might provide the most helpful response.

We try to respond to all issues as promptly as possible. Thank you for your patience!

sergiuwaxmann commented 1 month ago

Hello @oliverlibaw! Thank you for reaching out and bringing this to our attention. The message you're encountering is indeed expected behavior as part of the initialization process for your Cloud Training instance. This process involves spinning up a dedicated instance equipped with GPU resources, which can sometimes take a while depending on the current demand and availability of GPU resources.

sergiuwaxmann commented 3 weeks ago

@oliverlibaw

We just discovered that we have an issue when training Roboflow datasets using our Cloud Training (see #645). Our team is investigating this issue, and we'll update you as soon as we implement a solution.

We appreciate your patience and understanding.

sergiuwaxmann commented 3 weeks ago

Hello @oliverlibaw! We just released a new version that fixes the issue you had.

oliverlibaw commented 3 weeks ago

awesome! Thanks for the update @sergiuwaxmann