speaker-embedding Search Results

1000+ results
for speaker-embedding

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

thewh1teagle/sherpa-rs #33

Speaker diarization errors

I am having a lot of trouble with speaker diarization across lots of different platforms and models. ```toml [target.'cfg(any(windows, target_os = "linux"))'.dependencies] sherpa-rs = { version =…

WilliamVenner updated 6 hours ago
11
auspicious3000/autovc #61

question about training loss and inference performance

Hi, thank you for your very nice work! I have rerun this project, and it has run 90K steps. the loss_id_psnt is around 0.07. And I tried to feed into a in-domain speaker's melspec and his speaker emb…

zzw922cn updated 3 years ago
6
Sharpness1i/dittts #1

dit_net

import logging import math import random from dataclasses import dataclass, field, fields import dataclasses from math import pi from typing import Sequence, Tuple, Union, Optional import tor…

Sharpness1i updated 1 month ago
3
microsoft/unilm #802

how to use wavlm model to extract speaker embedding for spea…

Hi, I wanna use **wavlm** model to extract speaker embedding for speaker verification task. In [the paper](https://arxiv.org/pdf/2110.13900.pdf) it is mentioned that for the task of speaker verificat…

fatemeshiravand updated 2 years ago
2
myshell-ai/OpenVoice #220

The cloned voice is far from the reference speaker

Hi, I am trying out Open Voice (v1), and it mechanically worked, but the cloned voice is far from its reference speaker. Sometimes, I gave a male reference speaker mp3, and got back a female voice…

aicoder2048 updated 2 months ago
4
pyannote/pyannote-audio #1195

Investigate why using large embedding batch size makes thing…

```python from pyannote.audio.pipelines.utils.hook import ProgressHook pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization") ``` On my Mac, default `pipeline.embedding_batch_size…

hbredin updated 5 months ago
4
auspicious3000/autovc #30

Why need original speaker embeddings concatenated with origi…

Theoretically, the original speaker embedding information has already been contained in the spectrogram. The network will automatically squeeze the original speaker embedding information out after c…

nkcdy updated 3 years ago
10
myshell-ai/OpenVoice #138

Tone Color Embedding / Tone Color Converter training

Dear Developers! In the demo_part1.ipynb, it is written that ``` Obtain Tone Color Embedding The source_se is the tone color embedding of the base speaker. It is an average of multiple senten…

ulvi95 updated 9 months ago
1
NVIDIA/flowtron #130

Training loss becomes nan when the number of speakers change…

Hi. I trained the flowtron on two speakers, for a total of 50 hours, 25 for each. After that, I wanted to train the model for 10 speakers for 20-30 minutes each using the basic checkpoint of the mode…

Alexey322 updated 3 years ago
3
0nutation/SpeechGPT #46

mHubert codeHiFiGAN is not multispeaker

hello, the provided vocoder checkpoint using mHubert does not support multi-speaker. Do you have a multi-speaker checkpoint? `mhubert_vp_en_es_fr_it3_400k_layer11_km1000_lj`

ehosseiniasl updated 2 months ago
2

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for speaker-embedding

1000+ results
for speaker-embedding