-
Hello,
After following the README, I got the Android and iOS permission modals and after accepting, `result` nor `speechstart` events don't fire. The microphone is working because when I use Audio…
-
[FEATURE] Feature that recognizes words and audio
-
We just added support for more file types when you attach/paste/drop them. We also have support for turning audio into text, see (src/lib/speech-recognition.ts). Let's add support for importing audi…
-
### Problem Description
When I export a video with multichannel (5.1) audio from FCPX, the resultant file contains a 5.1 channel, 16-bit LPCM audio track which plays back just fine. If I import this …
-
Is the SenseVoice-Large model currently commercialized? If it is commercialized, where can I purchase the API? I would like to use the SenseVoice-Large model for audio event recognition.
-
### Component
Dasharo firmware
### Device
NovaCustom V56 14th Gen
### Dasharo version
v0.9.1-rc4
### Dasharo Tools Suite version
--
### Test case ID
AUD007.001
### Brief summary
AUD007 fail…
-
### Deep Learning Simplified Repository (Proposing new issue)
:red_circle: **Project Title** : Emotion Recognition from Audio using Deep Learning
:red_circle: **Aim** : To build a deep learning mode…
-
Hello,
I would like to inquire whether the training data for the qwen2-audio-instruction model includes the IEMOCAP dataset for fine-tuning in speech emotion recognition tasks. Any clarification on …
-
### System Info
```
None of PyTorch, TensorFlow >= 2.0, or Flax have been found. Models won't be available and only tokenizers, configuration and file/data utilities can be used.
None of PyTorch,…
-
```
path = r"D:\Project\Python_Project\FasterWhisper\large-v3"
model = WhisperModel(model_size_or_path=path, device="cuda", local_files_only=True)
segments, info = model.transcribe("audio.wav",…