-
-
With so much noise its basically unuseable. Google's was perfectly noise free.
Hope I am looking at correct samples:
https://soundcloud.com/mazzzystar/sets/speech-conversion-sample
-
Hi, I don't have a lot of experience with audio processing, what's unclear to me is how to map these features to the corresponding sound sequence?
I'm feeding these to an RNN along with the corr…
-
Some electrical sounds exist in the generated audio, which greatly affects the sense of hearing.
In the original text, there is some phase loss added to the loss. Do you have any thoughts of it?
-
I download the 272976 iter model, and run notebooks `synthesis.py` got error:
```
RuntimeError: Error(s) in loading state_dict for Tacotron:
Missing key(s) in state_dict: "encoder.cbhg.cbhg.conv…
-
Hello, I'm trying to design a neural network that, from a Mel spectrogram, predicts the f0, sp, ap parameters for the World Vocoder. To design it, I need to know what the relation function of the firs…
-
link: https://deepmind.com/blog/high-fidelity-speech-synthesis-wavenet/
paper: https://deepmind.com/documents/131/Distilling_WaveNet.pdf
referenced from:
- https://twitter.com/heiga_zen/status/…
-
Hi there , im training a Polish speech model , spectrograms looks quite good, even the sound in /Log category is not so bad like for 35k steps. Even thoo I can't synthesize any audio , from text file …
-
Hi,
I try to train a model using a custom 16kHz database. Tacotron 1 model training is successfully finished. When it comes to the wavenet training, i got the following error:
[Condition x == y di…
-
Sorry if this is off-topic (deepvoice vs tacotron) but it seems like the tacotron 2 paper is now released.
The speech samples sounds better than ever (I think):
https://google.github.io/tacotron/pub…