AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection
https://www.yoloworld.cc
GNU General Public License v3.0
4.28k stars 416 forks source link

The phenomenon of confusion in predicting targets during inference occurs after continuing training on a self built dataset #426

Open Unicorn123455678 opened 1 month ago

Unicorn123455678 commented 1 month ago

Will the parameters of the CLIP text editor be updated when fine-tuning the self built COCO dataset? When I was using Simple_demo.exe in the demo folder and used/YOLO World master/tools/work-dirs/xxx/epoch-80. pth (fine-tuning model parameters from my self built dataset) as a checkpoint, I found that it still seemed to be the parameters loaded from the CLIP native text editor during debugging. This may be the fundamental reason for my model prediction confusion. I hope the author can provide an answer! image

lvke9529 commented 1 month ago

Have you solved this problem?