-
i got this message
melspectrogram() takes 0 positional arguments but 2 positional arguments (and 2 keyword-only arguments) were given
What should I do?
-
my config: n_mel=128,n_fft=2048,n_hop=256,sr=16000
it takes about 20 second for getting melspectrogram from 8 second long wav file!!!
Is it normal for this code?
-
Situation:
I am trying to do reconstruct the 80 dimensional mel-filter spectrogram from the output of the encoder (conformer), using the Fastspeech2 TTS decoder
Ideally I'd like to use the Fasts…
-
Preparing the encoder, the synthesizer and the vocoder...
Loaded encoder "encoder.pt" trained to step 1564501
Synthesizer using device: cuda
Building Wave-RNN
Trainable Parameters: 4.481M
Loading…
-
when I run the project in the colab,
i will show the error below:
Using cuda for inference.
Reading video frames...
Number of frames available for inference: 223
Traceback (most recent call l…
-
Great job on implementing paper!
Question: why did you use python_speech_features.fbank instead of librosa.feature.melspectrogram ?
Both transformations are the same, right?
-
Hi Corentin,
When I am testing demo.py I keep getting this error message
Traceback (most recent call last):
File "C:\Users\garne\real-time-voice-cloning\demo_cli.py", line 80, in
encode…
-
Hi,
I think there is an issue with [`backend.magnitude_to_decibel()`](https://github.com/keunwoochoi/kapre/blob/f41eb4b2ba2fd484500584169ec05ae5e12e09b7/kapre/backend.py#L70). This calculates the d…
-
Hi Jong,
Thank you for such a useful implementation! Sorry for a silly doubt, but I am a beginner in MIR and working with Onsets and Frames for a project of mine.
Looking at the STFT and Mel Spec…
-
Is there any further development planned? I find it interesting to have a reasonable audio augmentations / features generation library accelerated with JAX.
The most common library, [audiomentation…