-
My wav file is converted into mono, 22050Hz, 16bit pcm beforehand. I got this error log:
----------
Existing language matches target language
Loading Whisper Model!
Discarding ID3 tags because m…
-
**Describe the bug**
I tried running STT on a system with 8 NVIDA A100 GPUs. I experienced that running on up to 4 devices does not scale well. Each batch seems to be serialized across the GPUs. In t…
-
**LocalAI version:**
LocalAI v2.19.4
Docker Image ID: d99f62d40302 / TAG: latest-cpu
**Environment, CPU architecture, OS, and Version:**
uname -a: Linux XXX 6.8.0-39-generic #39~22.04.1-…
-
In terms of TTS voice, I think you should give it a try edge_tts
It's free, and supports 80 different voices.
-
**🚀 Integrate [goruut](https://github.com/neurlang/goruut) phonemizer via the [pygoruut](https://github.com/neurlang/pygoruut) [wrapper](https://pypi.org/project/pygoruut/)**
I was frustrate…
-
As a Swiss-German, I would love to have text spoken in it. If possible, even in one of the various dialects.
There is a freely available dataset with 3 hours of high quality speech with transcript in…
-
Since I started training XTTS, it always happened to me that even if the evaluation metrics in certain epochs are better, the script does not save the model as “best”.
It seems that it only saves the…
-
Does piper support AMD GPU acceleration with rocm?
-
Tested on Windows 10 64 bit and Piper 2023.9.9-1 prerelease.
A ported cat was used.
When a text has around 6000 characters piper starts mumbling after reading a while.
Command used was:
$ cat de…
-
环境配置:
windows 11系统,用的下载版VideoLingo_v1.4
问题描述:
我已经将视频导入。点击开始处理字幕后,命令行出现如下提示:
🎬➡️🎵 Converting to audio with librosa ......
LLVM ERROR: Symbol not found: __svml_cosf8_ha