-
Hello,
I have made a lot of progress in getting this project to run. I am successfully training net1 to 70% accuracy, and net 2 to within 0.007 loss.
When I run eval2.py, I get the error descri…
-
Just ran into this midst of training. I assume maybe the epoch ended and it tried to save something to disk, which is only few MB in size however.
```
2023-05-01 13:03:54,309 INFO [trainer.py:757] E…
-
Hello. I need a quick and easy offline speech recognition on OrangePiPC (single-board computer, power-like raspberry pi 2/3) When using pocketsphinx_continuous recognition phrase occurs in 3-5 seconds…
-
I tried before the recent code update.
Now working fine, both tts and stt (ASR)
Do you have suggestion how to improve the speech recognition accuracy?
Do you know DeepSpeech (PyTorch)? I wonde…
-
I have a question,can you help me?
why you cut the cmudict, and only 36964 english words in cmu_lex_data_raw.c
I know the cmudict contains 130000 english words, and I test the cmu_lts_model, it was …
-
I just reached 1 million steps in training my Hebrew TTS model, and it sounds pretty good. However, I noticed it struggles to pronounce English. After investigating, I found that Hebrew in espeak-ng u…
-
The last time I've worked with this it was using [OpenCC](https://pypi.org/project/OpenCC/). It is much more up to date and seems to have an active community. Las release from hanziconv is from 2016.
-
For unseen F to seen M conversion, the resulting pitch is very close to the source speaker , especially if the source pitch is much higher than seen M pitch.
I've used SR-based data augmentation s…
-
Is it possible to support audio subtitle files in textgrid format?
-
**Is your feature request related to a problem or a limitation? Please describe...**
Export to subtitle file.
This is a file that can be uploaded to youtube or branded in a video. It shows a text…