-
### Feature Name
openai/whisper-large-v3
### Feature Description
- Research and implement whisper-larger-v3
- Whisper is a state-of-the-art model for automatic speech recognition (ASR) and speech …
-
> the turbo model is an optimized version of large-v3 that offers faster transcription speed with a minimal degradation in accuracy.
[openai/whisper: Robust Speech Recognition via Large-Scale Weak …
-
## 論文リンク
https://arxiv.org/abs/1903.10346
## 公開日(yyyy/mm/dd)
2019/03/22
## 概要
音声認識において人間に違いが知覚しづらく、かつ over-the-air (スピーカーで流したり、マイクを使って録音したり、残響効果がある中で音を流したり、という風に現実世界で使用する場合を想定したケース) でも誤認識させるよ…
-
# Taiwanese Hokkien Tone Recognition
This task aims to recognize "tones" in Taiwanese Hokkien. Taiwanese Hokkien is a tonal language with multiple tones that can change the meaning of words. Accura…
-
Currently, we're using the original Kinect API for speech recognition. Investigate the possibility of taking the audio signal and using the Bing Speech API instead - it's a far more accurate and robu…
-
Implement and integrate OpenAI's Whisper service for advanced speech-to-text capabilities.
Tasks:
- [ ] 1. Research and Documentation on Whisper API.
- [ ] 2. Integrate Whisper API for speech-to-…
-
Artificial Intelligence (AI) is a branch of computer science that focuses on building systems or machines capable of performing tasks that typically require human intelligence. These tasks include lea…
-
Multi-Domain Spoken Dialogue System with Extensibility and Robustness against Speech Recognition Errors. Komatani. 2006 SIGDIAL
http://www.aclweb.org/anthology/W06-1302
-
Problem:
The existing speech recognition function works well but has some limitations in terms of error handling and flexibility. Specifically:
1) Microphone access issues are not handled: If a m…
-
# Mr. Detective - Interrogate and Unravel Mysteries
![image](https://github.com/TusharAMD/SuperSpeechSaga/assets/59115865/e833f560-e012-4024-8179-896ed15963e6)
Welcome to Mr. Detective, a thrillin…