I am writing code to apply video captioning models on a single input video. Can you show me how to apply your model on a single input video or single input image?
Is there a demo I can follow for the step by step approach?
I'm new to this area and trying to understand what is required for testing video captioning models.
Hi,
I am writing code to apply video captioning models on a single input video. Can you show me how to apply your model on a single input video or single input image? Is there a demo I can follow for the step by step approach? I'm new to this area and trying to understand what is required for testing video captioning models.