Open IronmanVsThanos opened 1 year ago
how to inference a video?
You can extract each frame of the video and use Grounding-DINO to detect every frame, or you can try to use the open-world tracking model like DEVA to track objects on video data
how to inference a video?