-
### Description
We aim to enhance our speech-to-text (STT) model by fine-tuning it using exclusive speaker-specific data combined with our existing base training data. We will use Low-Rank Adaptation…
-
Is it not possible to transcribe long audio files, around ~3 hours? I am trying to transcribe the 3-hour audio to Hindi, but it uses huge memory.
```
import torch
import nemo.collections.asr as …
-
I followed the steps to add a vosk model. When accesing over http://localhost:20741 it is loading and understands me. But on wake up it keeps loading forever. Its the 3gb model. All installed on windo…
-
**Description:**
As a 'Data Analyst', I would like to only view sections that I need to submit data files for so that I'm not having to waste time or worry about data files that do not apply to my ST…
-
### What features would you like to see added?
What about "Upcoming STT/TTS Enhancements: The Google Cloud STT/TTS and Deepgram services are being planned for future integration."
### More detai…
-
Hello
Thanks for the pipeline, i was looking for some better usage stats that what open-webi provides by default.
Using the pipeline with your script, i get some better usage infos, and the users …
-
Is there a local deployment alternative for these services: Agora, AZURE_STT,AZURE_TTS,OPENAI
Similar to how OpenAI can be replaced with Olama
-
enemy stt
-
When I tested the new dataset, which has large nunber of siginicands I met:
alp/benchmarks/bench_compression_ratio/bench_alp_compression_ratio.cpp(293): error: Expected equality of these values:
…
-
### Description
We need to train stt-wav2vec2 model on the new datasets that we have gained also because of the new departments data introduced.
### Completion Criteria
Stt wav2vec2 model with better…