FoundationVision / GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
https://glee-vision.github.io/
MIT License
1.09k stars 85 forks source link

Open-vocabulary segmentation Demo #30

Open yashbhalgat opened 6 months ago

yashbhalgat commented 6 months ago

Hi @wjf5203, thank you for this interesting work!

Can you please provide a "demo" script for open-vocabulary instance segmentation on videos? Currently, I only see a TEST.md file that describes evaluating on existing datasets.

I also find the code a bit hard to follow to be able to implement this on my own. So, it would be very helpful if you can provide such a demo script.

Thank you! :) Yash

wassimea commented 6 months ago

+1