video-captioning-model Search Results

406 results
for video-captioning-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

usefulsensors/moonshine #19

How to achieve live transcription

The title of the paper https://arxiv.org/pdf/2410.15608 is > Moonshine: Speech Recognition for Live Transcription and Voice Commands However, the model is a non-streaming model, could you describe…

csukuangfj updated 1 week ago
26
w3c/webvtt #320

Live captioning - incremental cues review

**To address REQ2 of #318 , we are after an extension of the WebVTT file format.** The principle idea is that we map the TextTrack API calls from #319 to how we would archive them in a WebVTT file to…

silviapfeiffer updated 3 years ago
8
williamyang1991/Rerender_A_Video #26

module 'keras.backend' has no attribute 'is_tensor'

No module 'xformers'. Proceeding without it. ControlLDM: Running in eps-prediction mode DiffusionWrapper has 859.52 M params. making attention of type 'vanilla' with 512 in_channels Working with z…

pondloso updated 5 months ago
11
NVIDIA/DALI #741

Videos with various length and fps

Hi, Thanks for the nice library. I found DALI while looking for a video loader for action recognition. I found that DALI yet cannot handle various resolution as in the issue #725 which is necessary f…

kkjh0723 updated 4 years ago
14
vllm-project/vllm #7558

[RFC]: Support for video input

### Motivation. Currently models like `llava-hf/llava-next-video*` recognize image and video inputs with different tokens, and do different computations. Therefore vLLM should provide new APIs and …

TKONIY updated 3 days ago
17
boringresearch/Test_coursetemplate #1

Example of course

xihajun updated 1 year ago
1
jnhwkim/Pensees #5

StoryDALL-E: Adapting Pretrained Text-to-Image Transformers …

### StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation **Maharana et al., ECCV 2022** > Recent advances in text-to-image synthesis have led to large pretrained tran…

KyonP updated 1 year ago
1
Cxbx-Reloaded/game-compatibility #254

BeatDown - Fists of Vengeance [CC-021] [1.02]

[Xbe.txt](https://github.com/Cxbx-Reloaded/game-compatibility/files/1258030/Xbe.txt) [CxbxDebug.txt](https://github.com/Cxbx-Reloaded/game-compatibility/files/1255977/CxbxDebug.txt) [KrnlDebug.txt…

fatjohnny118 updated 4 years ago
1
w3c/wcag #795

Revisiting imbalance between 1.2.4 Captions (Live) (AA) and …

This picks up something I already noted about two years ago, but could maybe be discussed in the context of WCAG 2.2/silver ... https://lists.w3.org/Archives/Public/w3c-wai-gl/2017JulSep/0052.html …

patrickhlauke updated 5 years ago
28
Rangozhang/VideoCaption #1

Error using eval.lua on a video

Hi I discovered your work on VideoCaption and neuraltalk2 while working on a documentary about respublica Tuva which is a small country, near to Mongolia, federated by Russia. The movie itself is abo…

oxmah updated 7 years ago
9

上一页 1...5 6 7 8 9 10 11...41 下一页

406 results for video-captioning-model

406 results
for video-captioning-model