-
Proyectos que pueden ayudar.
# audio and text (aka forced alignment)
https://github.com/readbeyond/aeneas
# DeepSpeech
Modelos en español: Puntos de control previamente entrenados y modelos…
-
# Possible options to synchronize ASR in LFortran and LPython
A common solution to keep a library like `libasr` synchronized between multiple projects (`LFortran` and `LPython`) on GitHub is to use…
kp992 updated
1 month ago
-
### 🚀 The feature, motivation and pitch
As we all know, GPT-4o is an end2end multi-modal models, which support Speech to Text/Speech. I have some ideas about it:
1. Speech to Text: Can we have a t…
-
A cool feature that might be worth exploring would be allowing users to translate to any target language, rather than just english.
It is known that whisper was trained to take input language --> e…
-
**Is your feature request related to a problem? Please describe.**
I have a dataset with uncommon words that I cannot expect Whisper or any ASR model to be able transcribe accurately. The dataset is …
-
Hi,
I want to finetune this great model with my own data set. is it possible ? Is there any pre-trained model for this ?
Thanks in advance
-
@sanchit-gandhi
Where can I find faster-whisper model evaluation metrics? I don't see them on ASR leaderboard. Thanks!
-
**Describe the Bug**
Whenever using ES directly with the sectorfiles, the `Settings/VoiceChannels.txt` files added by GNG in the install package and the `version.txt` file added by ES when loading AS…
-
Can we turn this model to be used in real time streaming like what has been done in the Zipformer Model.
I saw some implementations for making the streamed Whisper https://github.com/ufal/whisper_str…
-
### Feature Name
openai/whisper-large-v3
### Feature Description
- Research and implement whisper-larger-v3
- Whisper is a state-of-the-art model for automatic speech recognition (ASR) and speech …