video-captioning-model Search Results

406 results
for video-captioning-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/GenerativeImage2Text #54

How to increase sample frames number to more than 6?

Hi, thank you so much for the great works! I have questions about sampled frame number, in the paper mentioned > During inference, we uniformly sample 6 frames with center crop. I am keen to kn…

ee2110 updated 10 months ago
5
antoyang/VidChapters #9

tokenizer in demo_vid2seq.py

hello, as i want to use demo_vid2seq.py to get video captioning, there are many questions which i don't understand, first, when i run demo_vid2seq.py, there is an error: load Vid2Seq model Traceba…

tickm updated 11 months ago
4
google-research/scenic #857

Request for access to Vid2Seq inference code for educationa…

Hi @xingyizhou,@a-nagrani and @antoyang, I'm writing to you because I'm interested in using the Vid2Seq model for dense captioning and video captioning on a few educational videos which are MP4 fil…

ChukwumaChukwuma updated 1 year ago
3
ioccc-src/temp-test-ioccc #5

Question: Issues that aren’t really major issues but are sti…

# TODO * [ ] Close this issue when the **great fork merge** happens. # Original comment Like the thread in the [other repo](https://github.com/ioccc-src/mkiocccentry/issues/171) this is to he…

xexyl updated 10 hours ago
1425
langchain-ai/langchain #11770

Video imagery to text (Closed Captioning)

### Feature request Implement a feature using Langchain's image_captions.py and audio_speech_to_text.py to produce .srt files. This system will provide both subtitles and visual scene descriptions, e…

A2113S updated 8 months ago
8
GX77/Dual-Stream-Transformer-for-Generic-Event-Boundary-Captioning #1

Combining Features of Swin Transformer and other Features

Hello, thank you for sharing your code. Can you help me in this scenario: For a video captioning model, I have sampled each video with 16 frames. I've employed a Video Swin Transformer to extract vid…

adeljalalyousif updated 1 year ago
2
tensorflow/tensorflow #24520

"ValueError: Cannot take the length of Shape with unknown ra…

**System information** - Have I written custom code (as opposed to using a stock example script provided in TensorFlow): yes - OS Platform and Distribution (e.g., Linux Ubuntu 16.04): "18.04.1 LTS…

dineshdharme updated 6 months ago
46
antoyang/VidChapters #7

inference without speech

hi! may i know how to do the inference without speech? I've set the --no_speech but so that the output is []. And when i do inference in activitynet and charades dataset, the output looks like it…

jli262 updated 11 months ago
3
comfyanonymous/ComfyUI #2708

Chrome pops up incessantly with, "http://127.0.0.1:8188/ say…

Was trying to load a workflow discussed [here ](https://www.youtube.com/watch?v=qW1I7in1WL0&t=236s )after fiddling w/ a few others when I started getting a pop-up error that comes right back if clicke…

CCpt5 updated 9 months ago
1
huggingface/transformers #27727

Streaming support in automatic-speech-recognition pipeline

### Feature request I'd like to request for the ability to stream back chunks of audio transcripts instead of having to wait for the entire audio to be processed. For real time use cases, it helps to…

CoderHam updated 11 months ago
2

上一页 1...18 19 20 21 22 23 24...41 下一页

406 results for video-captioning-model

406 results
for video-captioning-model