audio-visual-features Search Results

1000+ results
for audio-visual-features

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

EGO4D/audio-visual #12

Loading model for audio embeding

Hi, I am trying to extract the audio features from the clips. I've downloaded the clips and then I run run the code 'batch_audio_embedding.py'. (inside the folder audio-visual/active-speaker-detect…

emanuele-mincato updated 2 months ago
1
w3c/controller-document #23

Accessibility Self-Review of Controller Documents v1.0

The following issue contains the VCWG's Accessibility Self-Review of Controller Documents v1.0. The specification is a way of expressing identifiers and cryptographic material that is not exposed …

msporny updated 2 months ago
3
Gl0dny/hexapod #38

Issue 29: Music recognition

Gl0dny updated 1 week ago
2
ZebangCheng/Emotion-LLaMA #15

Transform my personal dataset into the MERR data format

Hello, great work!!! Could you please provide a script to transform my personal dataset into the MERR data format?

lucas0214 updated 4 weeks ago
5
v-iashin/video_features #121

Vggish feature vs i3d flow visual feature

Hi. I am trying to extract visual and audio features on raw video clips. For visual features, python main.py stack_size=24 step_size=8 extraction_fps=25 feature_type=i3d Eg. it gives 112x1024 dimens…

1980x updated 8 months ago
2
Nuvotion-Visuals/Harmony3 #45

Implement Audio-Visual Calls and Screen Sharing in Channels …

#### Overview We propose to implement audio-visual calls and screen sharing within our platform's channels using the WebRTC technology facilitated by the PeerJS client/server framework. This feature w…

tom-leamon updated 5 months ago
2
huggingface/diffusers #5760

DIFF-FOLEY: Synchronized Video-to-Audio Synthesis with Laten…

### Model/Pipeline/Scheduler description Video-to-Audio (V2A) models has recently gained attention for generating audio directly from silent videos, particularly in video/film production. However, pr…

clarencechen updated 11 months ago
2
hangzhaomit/Sound-of-Pixels #14

where is the pixelwise sound

Hi, I saw the func: forward_pixelwise in the code synthesizer, this is the one version of forward function that produce pixel-wise mask. However, throughout the code, and I found only the foward func …

TaoZheng9 updated 1 month ago
1
facebookresearch/av_hubert #85

Extraction of features with AV HuBERT

The tutorial mentioned for feature extraction. Are these the learned representations of AV-HuBERT or just extracting the features from input video file which needs to be passed to the AV HuBERT model…

shakeel608 updated 3 months ago
14
swarmauri/swarmauri-sdk #262

[Feature Request]: Speechify

## Feature Name Speechify ## Feature Description ## Overview of Speechify **Speechify** is a leading text-to-speech (TTS) platform designed to convert written text into natural-sounding sp…

abdulsamodazeez updated 1 month ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for audio-visual-features

1000+ results
for audio-visual-features