audio-recognition Search Results

1000+ results
for audio-recognition

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

MarioRuggieri/Emotion-Recognition-from-Speech #6

IndexError: string index out of range

Loading data from berlin dataset... Error in readAudioFile(): Unknown file type! Traceback (most recent call last): File "emorecognition.py", line 41, in db = Dataset(path,db_type,decode=…

flydragon2018 updated 5 years ago
1
mozilla/DeepSpeech #2443

Issue: Missing initial frames causes deepspeech to skip firs…

For support and discussions, please use our [Discourse forums](https://discourse.mozilla.org/c/deep-speech). If you've found a bug, or have a feature request, then please create an issue with the f…

alokprasad updated 3 years ago
18
realpython/python-speech-recognition #4

changing the key fails

I got a new GCP API key, and I tried to use it but I keep getting broken connection: `Traceback (most recent call last): File "main.py", line 13, in speech=r.recognize_google(audio,key='…

jzoudavy updated 6 years ago
3
ebowwa/caringmind #26

Python & SWIFT: Diarization and Embeddings

type IsSpeaking bool type WhoIsSpeaking uuid known speakers [chat on diarization embeddings](https://chatgpt.com/share/6704175b-9184-800f-bc01-2076a8af85bf) [chat on running models locall…

ebowwa updated 1 month ago
2
mravanelli/pytorch-kaldi #249

Word transcription of TIMIT dataset

How can word-level instead of phoneme-level speech recognition be done with the TIMIT dataset? I build and train models. On the other hand, I have only phoneme transcription. I want word transcriptio…

shessam updated 3 years ago
1
Vaibhavs10/insanely-fast-whisper #192

is_flash_attn_2_available() returns False

I installed the repo without CLI on virtualized instances from Vast.ai with A100 40GB and 80 GB. is_flash_attn_2_available() is False. Does it mean flash-attn is not used by inference. does it advers…

Majdoddin updated 4 months ago
1
samgolding9/zataangstuff #71

Feature Request: Voice enabled Quickgold?

``` Obviously this is a non-trivial request/suggestion. Something like the Google Mobile Voice recognition. Workflow: Tap home/Lift phone to ear *audio prompt* Say the name of the application QG la…

GoogleCodeExporter updated 8 years ago
1
alphacep/vosk-api #809

hardware dependend recognition problems: M$ Surface versus L…

Different recording hardware seems to make a big difference in the overall accuracy. On an AMD Ryzen5 with a Logitech C925e as input device at ~75% loudness level, the accuracy of the word "carola…

Cyborgscode updated 2 years ago
5
RIOT-OS/RIOT #18791

add ESP32 Eye board

#### Description [ESP-EYE](https://www.espressif.com/en/products/devkits/esp-eye/overview) is a development board for image recognition and audio processing, which can be used in various AIoT appli…

donsez updated 1 year ago
1
dynamic-superb/dynamic-superb #108

[Task] Musical Style Transformation

# Task Name Musical Style Transformation ## Task Objective The goal of this task is to transform a given music piece from one musical genre to another, preserving the original melody and lyri…

qaz159qaz159 updated 5 months ago
3

上一页 1...28 29 30 31 32 33 34...100 下一页

1000+ results for audio-recognition

1000+ results
for audio-recognition