[X] I have searched the Supervision issues and found no similar feature requests.
Question
Hi,
I’m trying to detect objects held in hand. Do you know of any models or datasets that are well-suited for this task?
If labeling is required, would it be better to use YOLO-world for bbox grounding?
Additionally, there are a large number of product classes involved. I’m wondering if it would be better to only detect objects and handle class recognition through retrieval methods.
Search before asking
Question
Hi,
I’m trying to detect objects held in hand. Do you know of any models or datasets that are well-suited for this task?
If labeling is required, would it be better to use
YOLO-world
for bbox grounding?Additionally, there are a large number of product classes involved. I’m wondering if it would be better to only detect objects and handle class recognition through retrieval methods.
Thank you!
Additional
No response