Open HuAndrew opened 5 months ago
Hi @HuAndrew thanks for your interest in YOLO-World! I'm concerned about whether the noun phrases are fixed types (like categories) or they are different for different cases, for example, there are 10 types of noun phrases (limited) or unlimited types of noun phrases (open).
For the limited version, you can use the normal fine-tuning setting to fine-tune YOLO-World for your applications and you need to prepare your data according to the coco format
.
For the unlimited/open version, you need to assign a text to each box annotation (replace the category with a text). I'll update a new dataset class for this case if you need it.
Thank you very much for your prompt response and assistance.
For the Funlimited/open version, we need to assign a text to each box annotation (replacing the category with a text).
We appreciate your help and support.
Best regards.
Hi @HuAndrew, you're welcome. I'll update a dataset class or you can try to use MixedGroundingDataset
first.
Gratitude
Firstly, I would like to express my appreciation for the open-source work on
yolo-world
. It has been significantly impactful to the industry.Description
I have a question regarding the labeling process when using
yolo-world
for fine-tuning or pre-training models. Specifically, I'm dealing with object detection tasks where the labels contain multiple noun phrases, such as "big_car_front" and "car_reg" (assuming "car_reg" might be a shorthand for "car_rear" or "car_registration").Question
yolo-world
support custom labels, and if so, how can I integrate them into the training process?Additional Information
yolo-world
version: [v2]Actual Behavior
[Practice lab: Using the tag "big_car_reg" directly for fine-tuning or pre-training may result in the loss of semantic information of the nouns, which could lead to suboptimal performance.]
Steps to Reproduce (for bugs)
If applicable, please provide steps to reproduce the issue.
Expected Behavior
What I expect to happen is that
yolo-world
should be able to handle custom labels and allow me to train the model to detect specific parts of objects with those labels.