audio-visual-speech-recognition Search Results

452 results
for audio-visual-speech-recognition

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

owickstrom/komposition #94

Feature: speech recognition for visual feedback on audio

Could wire up speech recognition on the audio chunks to: - auto-name the clips when importing - show the text an the audio block in the timeline Bonus: add keyword / topic extraction.

robinp updated 4 years ago
4
tszheichoi/awesome-sensor-logger #45

Headphone Motion Data Not Recorded When Using Both Headphone…

#### Environment - **Device**: iPhone 12 - **Earphones**: AirPods Pro (2nd Generation) - **Software Version**: iOS [Specify Version] #### Steps to Reproduce 1. Connect AirPods Pro to the iPhone…

supernaiter updated 4 weeks ago
4
kadirnar/ComfyUI-Transformers #12

ROADMAP of ComfyUI-Transformers

## Computer Vision: - [x] Add Depth Estimation pipeline - [ ] Add Image Classification pipeline - [ ] Add Image Segmentation pipeline - [ ] Add Mask Generation pipeline - [ ] Add Object Detecti…

kadirnar updated 3 months ago
1
dynamic-superb/dynamic-superb #113

[Task]Emoji-Grounded Speech Emotion Recognition

# Task Name Emoji-Grounded Speech Emotion Recognition ## Task Objective The primary goal of the Emoji-Grounded Speech Emotion Recognition (EG-SER) task is to develop a system that can accurat…

ericsunkuan updated 3 months ago
3
Hangz-nju-cuhk/Talking-Face-Generation-DAVS #15

Table 3: Audio-Visual Speech Recognition and 1:25000 audio-v…

Hi, after reading the paper, I am confused about the table 3. What is the meaning of visual acc, audio acc and combine acc? How did you calculate the result of 67.5%, 91.8%, 95.2%? ![default](http…

zzzzhuque updated 5 years ago
1
tamlhp/deepfake-benchmark #4

Papers on Audio Deepfake Detection

Every Breath You Don't Take: Deepfake Speech Detection Using Breath https://arxiv.org/abs/2404.15143

tamlhp updated 3 weeks ago
19
Sxjdwang/TalkLip #9

the face in output video is blurred

Hi, thanks for your great work! I tested talklip with my own video, but the generated face in output video is blurred and appear clear border with background. The resolution of my test video is 1600x9…

ZardYuan updated 10 months ago
3
huggingface/huggingface.js #174

Add inference demos

Add demos on https://huggingface.co/huggingfacejs (feel free to contribute demos, or to ask joining the organization) ### Natural Language processing - [ ] Fill mask - [ ] Summarization - [ ] …

coyotte508 updated 11 months ago
14
RobotStudyCompanion/Documentation #1

map out a simple system architecture

- [x] explore options for mapping out sys arch - [x] explore tools: mkdocs, markmap in VS Code - [x] prep Github Page for docs hosting - [x] visually represent the ideal RSC as a mindmap - [x] note &…

mbz4 updated 1 month ago
8
GasimV/Commercial_Projects #2

Speech Processing Models

`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…

GasimV updated 3 months ago
16

上一页 1...1 2 3 4 5 6 7...46 下一页

452 results for audio-visual-speech-recognition

452 results
for audio-visual-speech-recognition