ultralytics / ultralytics

Ultralytics YOLO11 🚀
https://docs.ultralytics.com
GNU Affero General Public License v3.0
33.51k stars 6.44k forks source link

use multi datsets to train YOLO-World #15877

Open pixiaopi2018 opened 3 months ago

pixiaopi2018 commented 3 months ago

Search before asking

Question

Hello, I want to use my own datasets to train a YOLO-World detector. But there is a question, for example, dataset_A has car/person in images, but only 'car' is labeled; datset_B also has car/person in images, but only 'person' is labeled. Thus, how can I put dataset_A and B to train YOLO-World simultaneously, the unlabeled objects won't affect the learning ?

Additional

No response

Y-T-G commented 3 months ago

You need to have both labeled in all the datasets.

pixiaopi2018 commented 3 months ago

You need to have both labeled in all the datasets.

Thanks for reply! I agree with you when training normal detector, but this is a ovd detector, the distribution of its train data is very wide (like Flickr/GQA/Object365). For example, some categories in Flickr, must not be labeled in obj365. So I have this question.

glenn-jocher commented 3 months ago

@pixiaopi2018 thank you for the clarification. For YOLO-World, you can indeed train with datasets where not all objects are labeled. The model is designed to handle such scenarios effectively. Just ensure that your datasets are properly formatted and the labels are consistent across them.

pixiaopi2018 commented 3 months ago

@pixiaopi2018 thank you for the clarification. For YOLO-World, you can indeed train with datasets where not all objects are labeled. The model is designed to handle such scenarios effectively. Just ensure that your datasets are properly formatted and the labels are consistent across them.

Thanks! In yoloword, what design is made to do this (same category labeled/not labeled in diffrerent datasets, but datasets can be trained together)? And what "the labels are consistent across them" means? Thanks for your patience agian!

glenn-jocher commented 3 months ago

YOLO-World uses a "prompt-then-detect" strategy, allowing it to handle datasets with varying labeled categories. By "consistent labels," I mean ensuring that the same object categories have the same label names across all datasets. This helps the model understand and detect objects correctly during training.