Open tlong123 opened 1 month ago
@tlong123 hi there! 👋 It sounds like you've done everything correctly by running check_dataset
, and it's surprising that the labels aren't recognized after the upload to the Ultralytics HUB. This issue might involve a subtle detail in the dataset structure or labeling format that's not caught by the preliminary checks.
A few things to consider:
train
, val
).Since the check_dataset tool indicates successful validation but the upload process is still failing, it might also be a good idea to re-zip and re-upload your dataset, ensuring no files are missing or corrupted in the process.
If you continue to face issues, please provide more details about your dataset's structure and any steps you've taken since encountering the error. For further guidance, the Ultralytics HUB Docs at https://docs.ultralytics.com/hub might offer additional insights into dataset preparation and troubleshooting.
Keep us posted! Your feedback helps us improve the HUB for everyone. 🚀
my suspicion is that it's failing maybe due to some expectation on the configuration of the path variable in the yaml file, but I also expected that that would have been picked up by the check_dataset function and the hub error doesn't give me enough information to debug the problem.
@tlong123 You're right that the check_dataset
function is designed to catch a wide range of common issues, but it might not cover every possible configuration or edge case, such as specific expectations around the YAML file's path configurations.
Your suspicion regarding the path variable in the YAML file could be a valid point of failure. The YAML file should accurately reflect the structure and location of your dataset's directories and files. If there's a discrepancy between the paths specified in the YAML and the actual dataset layout, it could lead to the issue you're experiencing.
Without altering the structure of your dataset, double-check the YAML file to ensure all paths accurately mirror your dataset's organization. This includes ensuring that the train
, val
, and if applicable, test
paths are correctly specified and point to the right directories.
If the problem persists, updating the YAML to reflect the exact structure of your dataset and re-uploading might help. If errors continue or the messaging remains unclear, we appreciate this feedback as we always aim to enhance the user experience and debugging capabilities on the HUB.
Thanks for sticking with us 🛠️, and please keep us posted on your progress!
@sergiuwaxmann any ideas how I could fix this?
Hello @tlong123!
In theory, the check_dataset
function should be enough but maybe you can take a look at our docs about uploading a dataset or at the Upload Dataset dialog (in Ultralytics HUB) where you can see an example of a dataset structure (based on the selected task) - see image below.
👋 Hello there! We wanted to give you a friendly reminder that this issue has not had any recent activity and may be closed soon, but don't worry - you can always reopen it if needed. If you still have any questions or concerns, please feel free to let us know how we can help.
For additional resources and information, please see the links below:
Feel free to inform us of any other issues you discover or feature requests that come to mind in the future. Pull Requests (PRs) are also always welcomed!
Thank you for your contributions to YOLO 🚀 and Vision AI ⭐
Search before asking
HUB Component
Datasets
Bug
I have created a new dataset based off of an existing one I have used before. After zipping it I ran the check_dataset function against it and recieved a "checks completed correctly" response, however when I upload it to the hub all I get is "Invalid The dataset labels cannot be located."
the output I get from check_dataset is as follows:
Environment
check_dataset ran on m1 mac pro
Minimal Reproducible Example
No response
Additional
No response