vocoding Search Results

109 results
for vocoding

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

bshall/knn-vc #28

Will this work for singing voice conversion (svc)?

Great repo! Ran some tests with it and it sounds good for speech, but the limited testing I did for singing didn't sound too great. Is this expected / is there a way to adapt it to work well with sing…

billnye2 updated 1 year ago
2
yl4579/StyleTTS2 #24

iSTFTNet and LibriTTS

I'm curious if you can share any observations about using iSTFTNet with LibriTTS. The paper implies that the performance of iSTFTNet was insufficient for LibriTTS and so HiFiGAN was adopted, but I was…

stevenhillis updated 1 year ago
3
librosa/librosa #1700

time_stretch does not return the same signal when rate=1

**Describe the bug** time_stretch does not return the same signal when rate=1. **To Reproduce** ``` import librosa import matplotlib.pyplot as plt audio_signal, _ = librosa.load(librosa.ex('…

zafarrafii updated 8 months ago
12
tarepan/VoiceConversionLab #538

'Voice Conversion' paper candidate 2310.14301

Please check whether this paper is about 'Voice Conversion' or not. ## article info. - title: **An overview of text-to-speech systems and media applications** - summary: Producing synthetic voice, s…

github-actions[bot] updated 12 months ago
2
X-LANCE/UniCATS-CTX-txt2vec #1

Example usage

Thank you very much for the repository - do you have any usage examples for the different tasks such as continuation & editing? :-)

danablend updated 11 months ago
6
RVC-Project/Retrieval-based-Voice-Conversion-WebUI #658

Manual speech correction

Sometimes HUBERT mishears words (phonetics?) and transcribes them incorrectly. Is there a potential solution where you can manually write what is being fed when vocoding?

kalomaze updated 1 year ago
1
RF5/simple-asgan #3

The training result is very different from the paper

My parameter settings remain the same as those provided by you, but the training results obtained are very different from those in the paper. The FAD of training is always not less than 1. I would lik…

paopaoyaya updated 9 months ago
43
shivammehta25/Matcha-TTS #39

Possible to manipulate text projection to elongate phonemes …

I'm reading through the paper, and I'm wondering if during inference time, could you manipulate the duration predictor, or some other part to allow controllable elongation of certain phonemes? …

artificalaudio updated 6 months ago
22
neonbjb/tortoise-tts #531

Find a workaround for high RAM usage maybe?

The attempt of speeding up the inference would only make sense if it doesn't keep filling the RAM, only then it would be a production level open source library. What's the point if one has to delete t…

ghost updated 1 year ago
6
bshall/knn-vc #13

Question about WavLM layer choice

In your paper, you say: > Recent work confirms that later layers give poorer predictions of pitch, prosody, and speaker identity. Based on these observations, we found that using a layer with high …

space-pope updated 1 year ago
2

上一页 1...1 2 3 4 5 6 7...11 下一页

109 results for vocoding

109 results
for vocoding