-
**Is your feature request related to a problem? Please describe.**
As I only speak English, I cannot really use the cosyvoice gradio interface.
**Describe the solution you'd like**
Please conside…
-
Hi,
I specified the language model for transcription in params as follows, and prepared a wav file according to the model format.
```rb
params = {
'action' => "start",
…
-
Now that we have run a few participants we can get an idea of the language model (the sentences and vocabulary) used in the test. Instead of using the Wall Street Journal language model from Sphinx we…
-
### Is your feature request related to a problem? Please describe.
Summary
I am writing to propose the addition of a content filter layer for both the input and output stages of the project's lang…
-
I dont know if this is a bug or just some kind of usage error but i have tried both pocketsphyx and prorcupine as wake word providers. Both work and have good recognition rates but the recognition aft…
-
## Adding a Dataset
- **Name:** VoxLingua107
- **Description:** VoxLingua107 is a speech dataset for training spoken language identification models. The dataset consists of short speech segments aut…
-
I run into a weird issue where I can't find the cause of it (using latest git version and model: large-v3).
I generate translated subtitles from a danish audio file. The word level timings are pretty…
-
- Azure log file: https://drive.google.com/file/d/1vEO1PlaPGDX77Iru4DDD3XG15zEcK4UA/view?usp=sharing.
- Code to reproduce: https://codefile.io/f/HLanoKgTH6.
- Model file: https://drive.google.co…
-
I'm training the flowtron model from scratch on the LJSpeech dataset. It seems to run ok. However, after nearly three days, the attention matrix still has the following form and the resulting generate…
-
> [!NOTE]
> This document outlines the planned features and changes for version `1.0.0.0`. It is a development roadmap and **NOT** a final release note. Features and timelines may be subject to chang…