audio-recognition Search Results

1000+ results
for audio-recognition

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

avinashkranjan/Amazing-Python-Scripts #1539

[GSSOC'23] Voice Assistant AI Bot: Empowering Conversations …

# Aim The aim of the AI Bot project is to create a voice assistant that can understand user input, generate responses using the OpenAI API, and provide both textual and auditory feedback. # Detail…

slayerrr12 updated 12 months ago
1
Hangz-nju-cuhk/Talking-Face-Generation-DAVS #15

Table 3: Audio-Visual Speech Recognition and 1:25000 audio-v…

Hi, after reading the paper, I am confused about the table 3. What is the meaning of visual acc, audio acc and combine acc? How did you calculate the result of 67.5%, 91.8%, 95.2%? ![default](http…

zzzzhuque updated 5 years ago
1
IAHispano/Applio #698

[Feature]: Add RMSEnergyExtractor [audio feature extraction]…

### Description What Does RMSEnergyExtractor Do? Calculates RMS Energy: RMS energy is a measure of the power of an audio signal. It is computed as the square root of the average of the squared …

Mixomo updated 1 month ago
3
bambocher/pocketsphinx-python #42

RuntimeError: new_Decoder returned -1

hi i want speech recognition using sphinx but as the accuracy of sphinx is not good at all. so need to decode the wav file but getting some error into the file....can any one please help me on this. …

vipulsurya updated 4 years ago
2
SlapBot/stephanie-va #24

Error in loading Google_Cloud credentials

Hi, This is a bug or error, maybe. This [line](https://github.com/SlapBot/stephanie-va/blob/master/Stephanie/AudioManager/audio_recognizer.py#L50) passes the key as json file. And therefore [speec…

humanely updated 4 years ago
2
microsoft/table-transformer #169

Question on fine-tuning TATR with a proprietary dataset

Hi! I am trying to fine tune the TATR model with a proprietary dataset. I am currently trying to convert the dataset to the same format as FinTabNet and then using the script in this repository (s…

srivatsan-sridhar99 updated 4 months ago
3
k2-fsa/icefall #1674

[Help needed] Support https://huggingface.co/datasets/Alex-S…

[MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research](https://arxiv.org/pdf/2406.18301) The above paper has just open-sourced a dataset fo…

csukuangfj updated 4 months ago
2
Uberi/speech_recognition #671

FLAC conversion utility not available

Steps to reproduce ------------------ 1. (How do you make the issue happen? Does it happen every time you try it?) 2. (Make sure to go into as much detail as needed to reproduce the issue. Postin…

Lvjinhong updated 6 months ago
3
memo/ofxMSATensorFlow #32

creating an example for sound gesture recognition with ofxMS…

Hi Memo Would be interesting to have a demo of audio gesture recognition with ofxMSATensorFlow. not necessarily only speech, but any sort of audio musical and non-musical gesture (both deterministi…

ghost updated 5 years ago
1
Amir231123/Meo #2

transcribe_audio.py.

Amir231123 updated 4 months ago
1

上一页 1...21 22 23 24 25 26 27...100 下一页

1000+ results for audio-recognition

1000+ results
for audio-recognition