vision-and-audio Search Results

1000+ results
for vision-and-audio

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #19865

Add VATT model

### Model description Hey, as discussed with @NielsRogge a few weeks back, I'd like to work on adding the "VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text"…

johko updated 9 months ago
8
facebookresearch/ImageBind #60

help with embedding arithmetic and image retrieval

Hi, Thanks for your great work. I am interested in the embedding arithmetic and image retrieval, as the example shown in Figure 4 of the paper. In the paper, the embedding arithmetic is described…

bakachan19 updated 5 months ago
3
sgl-project/sglang #1487

Development Roadmap (2024 Q4)

Here is the development roadmap for 2024 Q4. Contributions and feedback are welcome ([**Join Bi-weekly Development Meeting**](https://t.co/4BFjCLnVHq)). Previous 2024 Q3 roadmap can be found in #634. …

Ying1123 updated 10 hours ago
8
EliasKotlyar/Xiaomi-Dafang-Hacks #1866

support WYZE Cam v3

[[WYZE Cam v3 with Color Night Vision, Wired 1080p HD Indoor/Outdoor Video Camera, 2-Way Audio, Works with Alexa, Google Assistant, and IFTTT]](https://www.amazon.com/Vision-Indoor-Outdoor-Camera-Assi…

wp-coin updated 1 year ago
1
facebookresearch/ImageBind #19

Vision x Vision NOT what we want

![image](https://github.com/facebookresearch/ImageBind/assets/99708007/dd7f1421-7a84-44dc-925c-b7a60afaea7f) As you can see above, I use the original assets(text, image, audio) in main branch, and fi…

zxyonaroll updated 1 year ago
3
rigaya/QSVEnc #216

Ubuntu 24.04 dynamic HDR10 hangs

Hi! I'm trying to use QSVEnc 7.70 with an Arc A380 to transcode my 4K HEVC HDR10Plus files to 4K AV1 HDR10Plus. I've extracted the HDR10Plus metadata to a file using [hdr10plus_tool](https://github.co…

supersnellehenk updated 2 days ago
19
DevProgress/maps-showcase #9

How are we handling accessibility?

There are a number of possibly accessibility concerns for this project. The ones that came to mind include: Captioning for videos, and any other audio-only things should be transcribed. For blind …

suzannehillman updated 8 years ago
3
pytorch/pytorch #90560

[discussion, idea] Batched, vectorized base64 decoding / enc…

### 🚀 The feature, motivation and pitch Discussed in context of scriptable base64 decoding here: https://github.com/pytorch/vision/issues/6878#issuecomment-1343439120, http://www.alfredklomp.com/pr…

vadimkantorov updated 2 months ago
10
lobehub/lobe-chat #497

[Request] Autoplay TTS For the Agent

### 🥰 需求描述 | Feature Description Have an option for the agent to automatically convert every interaction to TTS. ### 🧐 解决方案 | Proposed Solution On the Agent Panel Creation, have a switch button tha…

lmsutools updated 2 months ago
2
firebase/firebase-ios-sdk #13000

[FR]: Add Vertex AI Vision & Audio Sample Code for iOS

### Description @andrewheard 1. As we upgraded to 1.5 Flash (https://github.com/firebase/firebase-ios-sdk/pull/12979), is it possible to achieve like [Project Astra](https://youtu.be/nXVvvRhiGjI) no…

1998code updated 5 months ago
3

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for vision-and-audio

1000+ results
for vision-and-audio