AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection
https://www.yoloworld.cc
GNU General Public License v3.0
4.43k stars 430 forks source link

Under what environment were performance FPS tested? #162

Open liuhuiCNN opened 6 months ago

liuhuiCNN commented 6 months ago

Under what environment were performance FPS tested? What’s the CUDA cudnn tensorrt version? And what‘s the test methond or test script? Thank you!

taofuyu commented 6 months ago

maybe you should read the paper first

wondervictor commented 6 months ago

Hi @liuhuiCNN, we evaluate the FPS with V100. For LVIS, we do not use TensorRT or FP16. For COCO, we adopt TensorRT(8) (CUDA=11.3)

liuhuiCNN commented 6 months ago

Thanks. And is the FPS tested include NMS?

wondervictor commented 6 months ago

Considering that NMS can be optimized by reducing the pre_nms (reducing pre_nms to 3000 does not affect the accuracy) and downstream deployment, we exclude the NMS during FPS evaluation (LVIS).