-
Hello
I'm tiring to make new model on VCTK dateset first I'm make generate speaker embedding by using code python generate_embeddings.py, Now Traing model by using file train.py but i have proble…
-
## Issue Description
When I try to run diarize.py or the Jupyter Notebook version, I encounter a dimension mismatch issue during the MSDD step
## log
```
100% [............................…
-
from mpl_toolkits.axes_grid1 import make_axes_locatable
from matplotlib.animation import FuncAnimation
from resemblyzer.audio import sampling_rate
from matplotlib import cm
from time import sleep,…
-
# Hello,
I'm currently working with pyannote/speaker-diarization-3.1 for embedding, specifically using pyannote/wespeaker-voxceleb-resnet34-LM. I've been experimenting with different batch sizes (3…
-
Embedding a Meetup group link using the appropriate group works fine on the Speaker single view, but the CSS `style` tags are stripped from content of the block's `wp-block-embed__wrapper` element in …
-
I am trying to train a TTS but I am wondering about the style of the speakers? My dataset contains multiple speakers with different speaking styles. Does the model retain the style for each voice or i…
-
### Describe the bug
Intel Mac.
```
tts=TTS(model_name='multi-dataset/xtts_v2/en',progress_bar=True).to('mps')
tts.tts_to_file(text='testing', file_path='out.wav', speaker='Craig Gutsy', language=…
-
I run make_spect.py and make_metadata.py to prepreocess the dataset (I used all speakers in VCTK). And then I used pretrained model of Speaker Encoder to extract speaker embedding and train the model…
-
### Tested versions
- Reproduced in 3.1.0
### System information
Ubuntu 22.04, Lenovo P1 Gen 5 Workstation A4500
### Issue description
I wanted to improve my segmentation with Pyannote, since mos…
-
Hello. I want to add Turkish to the CosyVoice model. How do I add a new language? What should I do? I want to add a new language and use it by fine tuning it.