CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time
Other
52.05k stars 8.71k forks source link

Abort when python demo_toolbox.py #11

Closed Interfish closed 5 years ago

Interfish commented 5 years ago

Hi: I am trying to run your code on a centos server with X11 forwarding open. But when I try python demo_toolbox dataset , it prints

Arguments:
    datasets_root:    dataset
    enc_models_dir:   encoder/saved_models
    syn_models_dir:   synthesizer/saved_models
    voc_models_dir:   vocoder/saved_models
Aborted

I believe I installed all required packages. Looks like the error is not caused by python but some low level call. So is there any way to print more error message? Or is there any way to run without GUI ? (I think although I open X11 forward on this server but it still might not fit as good as a pure GUI machine).

Thanks!

jarar21 commented 5 months ago

Preparing the encoder, the synthesizer and the vocoder... Loaded encoder "encoder.pt" trained to step 1564501 Synthesizer using device: cuda Building Wave-RNN Trainable Parameters: 4.481M Loading model weights at saved_models\default\vocoder.pt Testing your configuration with small inputs. Testing the encoder... Traceback (most recent call last): File "C:\voice\demo_cli.py", line 83, in embedding = encoder.embed_utterance(audio_waveform) File "C:\voice\encoder\inference.py", line 144, in embed_utterance frames = audio.wav_to_mel_spectrogram(wav) File "C:\voice\encoder\audio.py", line 58, in wav_to_mel_spectrogram frames = librosa.feature.melspectrogram( TypeError: melspectrogram() takes 0 positional arguments but 2 positional arguments (and 2 keyword-only arguments) were given

Check my comment

jarar21 commented 5 months ago

when I try to record the audio I got an exception as

Traceback (most recent call last): File "F:\GenAi\Real-Time-Voice-Cloning\toolboxinit.py", line 163, in record wav = self.ui.record_one(encoder.sampling_rate, 5) File "F:\GenAi\Real-Time-Voice-Cloning\toolbox\ui.py", line 224, in record_one self.set_loading(i, duration) File "F:\GenAi\Real-Time-Voice-Cloning\toolbox\ui.py", line 384, in set_loading self.loading_bar.setValue(value * 100) TypeError: setValue(self, value: int): argument 1 has unexpected type 'numpy.float64'

use this int(value 100)) self.loading_bar.setValue(int(value 100))

andresgomesz commented 5 months ago

Hi,

I was able to fix the issue using an older versions of Python and another libraries. When I installed the software I did it with the latest versions, and this was the problem to me.

I wish this can help to your problem.

Best.

dam23333 commented 5 months ago

help please Capture d’écran 2024-04-08 à 06 08 44

jarar21 commented 5 months ago

help please Capture d’écran 2024-04-08 à 06 08 44

python demo_toolbox.py -d "J:/AI-Voice-App/"(Use you own path where your dataset is saved) -m saved_models

Shappysenpai commented 3 months ago

Preparing the encoder, the synthesizer and the vocoder... Loaded encoder "encoder.pt" trained to step 1564501 Synthesizer using device: cuda Building Wave-RNN Trainable Parameters: 4.481M Loading model weights at saved_models\default\vocoder.pt Testing your configuration with small inputs. Testing the encoder... Traceback (most recent call last): File "C:\voice\demo_cli.py", line 83, in embedding = encoder.embed_utterance(audio_waveform) File "C:\voice\encoder\inference.py", line 144, in embed_utterance frames = audio.wav_to_mel_spectrogram(wav) File "C:\voice\encoder\audio.py", line 58, in wav_to_mel_spectrogram frames = librosa.feature.melspectrogram( TypeError: melspectrogram() takes 0 positional arguments but 2 positional arguments (and 2 keyword-only arguments) were given

were you able to get a solution??