-
Thank you for your excellent work. I have some questions that I hope to receive your answers to. I hope to apply TFVTG to my custom video dataset to test the video temporal grounding function. What sh…
-
We combine Grounding DINO, Grounding DINO 1.5 and SAM 2 for tracking any object in the input video and we've open-sourced our code here: [Grounded SAM 2](https://github.com/IDEA-Research/Grounded-SAM-…
-
Thank you for your selfless sharing. May I ask when the open source Video Temporal Grounding related test code will be available? Look forward to your reply
-
**_您好,我用grounding-dino微调自己的数据集得到的模型进行视频推理(demo/video_demo.py)时报错了,问题如下_**
(mmenv_new) (base) [zhoujianbang@ai mmdetection-dev-3.x]$ /ssd2/zhoujianbang/envs/mmenv_new/bin/python /ssd/zhoujianbang/proj…
-
🎉The finetuning(VQA/OCR/Grounding/Video) for Qwen2-VL-Chat series models has been supported, please check the documentation below for details:
# English
https://github.com/modelscope/ms-swift/blob/m…
-
Hi,
When I ran the code python grounded_sam2_local_demo.py
the result was good with a prompt text="car. road."
![grounded_sam2_annotated_image_with_mask](https://github.com/user-attachments/assets/…
-
### Feature Request: Implementing Masked Video Segmentation with Object Detection - GroundingSAM with Overeasy
**Description:**
I would like to request the integration of masked segmentation from …
-
Hello, thank you for your work. I would like to ask why you think the task of synchronized subtitles is important. How can it help in action generation and action understanding?
-
Thank you for your excellent work.
For the Video Temporal Grounding task, I trained the model using your provided "bash scripts/qvhl_pretrain_mamba.sh" command. However, the metrics I obtained were…
-
As stated in the question, when the object I want to detect does not appear at the beginning of my video, the code will report an error when running. What method should I use to eliminate this hidden …