video-captioning-model Search Results

406 results
for video-captioning-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

iburenko/multimodal-reading-group #12

[Paper Suggestion] PLLaVA : Parameter-free LLaVA Extension f…

Vision-language pre-training has significantly elevated performance across a wide range of image-language applications. Yet, the pre-training process for video-related tasks demands exceptionally larg…

iburenko updated 2 months ago
2
dajinstory/daily-arxiv-noti #102

New submissions for Wed, 12 May 21

## Keyword: super resolution There is no result ## Keyword: gan ### Towards Discovery and Attribution of Open-world GAN Generated Images - **Authors:** Sharath Girish, Saksham Suri, Saketh Rambhatla…

dajinstory updated 3 years ago
1
dajinstory/daily-arxiv-noti #34

New submissions for Fri, 4 Dec 20

## Keyword: detection ### Video Anomaly Detection by Estimating Likelihood of Representations - **Authors:** Yuqi Ouyang, Victor Sanchez - **Subjects:** Computer Vision and Pattern Recognition (cs.C…

dajinstory updated 3 years ago
1
livepeer/grants #193

[Video Disruptors Grant]:AI extended Livepeer SDK( player an…

### Please describe your project. Start with the need or problem you are trying to solve with this project. Describe why your solution is going to adequately solve this problem. ### Challenge: …

scapula07 updated 9 months ago
15
webmachinelearning/webnn #375

Support for transformers

While our [draft charter](https://www.w3.org/2023/03/proposed-webmachinelearning-charter.html) says that the group: > priority on building blocks required by well-known model architectures such as re…

dontcallmedom updated 1 week ago
35
bmaltais/kohya_ss #2701

Flux.1 LoRA training

Kohya has added preliminary support for Flux.1 LoRA to his SD3 branch. I have created a `sd3-flux.1` branch and updated to the latest sd-scripts sd3 branch code... No GUI integration yet... I will sta…

bmaltais updated 3 weeks ago
485
w3c/tt-reqs #8

Support 3D space (360°/VR/XR) as target presentation environ…

In the past years more and more applications show up that show media content in 3D space, like 360° videos (stereoscopic or not), VR experiences, etc.. Subtitles (if present) are mostly shown at the b…

pthopesch updated 5 years ago
14
ufal/whisper_streaming #121

Dealing with constant hallucinations

Using the large-v3 model to transcribe greek audio from a live stream, I am often met with continuous results writing "Υπότιτλοι AUTHORWAVE" It seems the model is bugged in a way that outputs that …

J-Korn updated 2 weeks ago
7
w3c/webvtt #503

Behavior with controls, particularly non-native controls, ov…

What should be the expected behavior of cues when controls or obscure the cues? According to the spec, cue rendering should be re-done when the native controls are shown (steps 4 and 5 of [the Proc…

gkatsev updated 1 year ago
23
mutonix/Vript #6

run the demo in A100 80G: CUDA out of memory

LAW1223 updated 3 months ago
5

上一页 1...10 11 12 13 14 15 16...41 下一页

406 results for video-captioning-model

406 results
for video-captioning-model