AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection
https://www.yoloworld.cc
GNU General Public License v3.0
4.34k stars 418 forks source link

Try OV-DINO, a more powerful open-vocabulary detector. #452

Open wanghao9610 opened 1 month ago

wanghao9610 commented 1 month ago

Thanks for the awesome YOLO-World, I share our recent work 🦖OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion.

We have released the evaluation, fine-tuning, demo code in our project, feel free to try our model for your application.

Project: https://wanghao9610.github.io/OV-DINO

Paper: https://arxiv.org/abs/2407.07844

Code: https://github.com/wanghao9610/OV-DINO

Demo: http://47.115.200.157:7860

Welcome everyone to try our model and feel free to raise issue if you encounter any problem.

ForestWang commented 1 month ago

how about the inference time compared with yolo-world?

wanghao9610 commented 1 month ago

@ForestWang We haven't tested the inference time systematically, the inference time is already fast according to the demo without any deployment optimization.