-
Hello!
I use this code:
```
from subtoaudio import SubToAudio
sub = SubToAudio(model_name="tts_models/multilingual/multi-dataset/xtts_v2")
subtitle = sub.subtitle("texts/1-1.srt")
sub.conv…
-
### What is the issue?
Hey amazing team! I’m experiencing an issue with the context window size when using the new Mistral Nemo model on Ollama version 0.2.8-rc2 on my Apple Mac Silicon M2 Pro. Accor…
-
## 🐛 Bug
onnx export failed when output size are not factor of input size for adaptive_avg_pool2d
## To Reproduce
Steps to reproduce the behavior:
1. Build a model with `nn.AvgPool2d(kerne…
ghost updated
7 months ago
-
Please check whether this paper is about 'Voice Conversion' or not.
## article info.
- title: **MSM-VC: High-fidelity Source Style Transfer for Non-Parallel Voice Conversion by Multi-scale Style Model…
-
I train single speaker model from scratch like as **Instructions to run** in README.MD but i get this error when start training
CUDA_LAUNCH_BLOCKING=1 python pflow/train.py experiment=ljspeech
…
-
# Intro
Hi, I followed AMI recipe and successfully trained a model (actually tuned p value which is 0.007). I got very similar dev/eval DER scores with reported ones. Then I tried to do inference wit…
-
Hello. Just discovered this.
Is there a way to set language? Maybe changing speaker?
I'd like to read Spanish epub.
-
Hi
Do you know if CPU and MPS support is on the roadmap?
Thanks!
-
# Raw string literal
* [x] Proposed
* [x] Prototype: No prototype needed.
* [x] Implementation: In: https://github.com/dotnet/roslyn/tree/features/RawStringLiterals
* [x] Specification: https://…
-
From the huggingface repo, the stated speed is 2.5% real-time:
```
Real-time factor
Real-time factor is around 2.5% using one Nvidia Tesla V100 SXM2 GPU (for the neural inference part) and one Inte…