audio-visual-learning Search Results

VaibhavCodeClub/learn #75

New Feature: Learning from a video and learning from poems

**Is your feature request related to a problem? Please describe.** currently the learning experience in the app is text and audio based, some might find it challenging to grasp concepts. **Describ…

patelkiran185 updated 3 months ago

WordPress/Learn #2854

Introduction to WordPress Coding Standards

# Details - Content type (Online Workshop, Lesson, Course, Tutorial, or Lesson Plan): Lesson - Content title: Introduction to WordPress Coding Standards - Topic description: Explain to the ne…

jonathanbossenger updated 3 days ago

sterrettJD/gpLM-reading-group #3

some curriculum suggestions

Hey John! Here's the curriculum that I've worked on in the past. It's a bit less focused on language models as a sole topic, and more on modern ML from a broad perspective. - Essential Concepts of …

zmaas updated 1 week ago

tkuri/papers #229

Learning to Predict Salient Faces: A Novel Visual-Audio Sali…

## 論文概要映像中の音声が人間の注意力に影響を与えることを明らかにした研究。大規模な音声付アイトラッキング映像データベース(34名被験者が300の動画を視聴)を取得して傾向を分析、基本的に人の顔に注意が行くことを確認。映像中のSaliencyを予測するためのマルチモーダルNNを提案。 ![bib_20200920 00](https://user-images.githubuserco…

tkuri updated 3 years ago

UserModels2223/group-project-m-f-h #1

Comment on mfh by group edluma

Greetings, After being subjected to a very distracting stimulus, it might be hard to focus on a more subdued one. With that in mind, do you think that the interspersed audio+visual stimuli might ne…

No-IT-u-Love updated 1 year ago

yyf17/NavigationProject #8

CVPR 2022

CVPR 2022 # 格式 * **Paper Title** *Author(s)* CVPR, 2022. [[Paper]](link) [[Code]](link) [[Website]](link) 需要填充： 1）Paper Title 2） Author(s) 3） 3个“link” 4）两篇文章之间间隔一行 # agent Meta Ag…

yyf17 updated 2 years ago

hche11/Localizing-Visual-Sounds-the-Hard-Way #14

The reason why using theshold (0.5) in cal_CIoU

Hi! Why do you use the theshold (0.5) in cal_CIoU, although the training doesn't give any information about the 0.5? In other words, is it just from the hyp-param tunning, or reasoned from mathemat…

Sunjuhyeong updated 5 months ago

tamlhp/deepfake-benchmark #4

Papers on Audio Deepfake Detection

Every Breath You Don't Take: Deepfake Speech Detection Using Breath https://arxiv.org/abs/2404.15143

tamlhp updated 1 week ago

huggingface/transformers #28236

Add CAV-MAE audio-image encoder model

### Model description Contrastive Audio-Visual Masked Autoencoder (CAV-MAE) combines two major self-supervised learning frameworks: contrastive learning and masked data modeling, to learn a joint and…

rationalism updated 8 months ago

Nuvotion-Visuals/Harmony3 #45

Implement Audio-Visual Calls and Screen Sharing in Channels …

#### Overview We propose to implement audio-visual calls and screen sharing within our platform's channels using the WebRTC technology facilitated by the PeerJS client/server framework. This feature w…

tom-leamon updated 4 months ago

1000+ results for audio-visual-learning

1000+ results
for audio-visual-learning