FoundationVision / GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
https://glee-vision.github.io/
MIT License
1.06k stars 82 forks source link

RVOS inference code #29

Open bio-mlhui opened 5 months ago

bio-mlhui commented 5 months ago

Hello, GLEE is a wonderful work. I saw that the RVOS code part is not finished yet. Can you update the GLEE.py for RVOS inference?

bio-mlhui commented 5 months ago

What object queries similarity measure you used when evaluating RVOS in your appendix? Can I use simple cos distance?

alsichcan commented 4 months ago

+1