w-okada / voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer
Other
15.34k stars 1.65k forks source link

[ISSUE]: Inputted Audio Reduced Speed & Frequency #1058

Open Sunwarrior312 opened 6 months ago

Sunwarrior312 commented 6 months ago

Voice Changer Version

MMVCServerSIO_win_onnxgpu-cuda_v.1.5.3.5a.zip

Operational System

Windows 11

GPU

NVIDIA RTX 3060

Read carefully and check the options

Model Type

RVC

Issue Description

I've tried to use RVC but for some reason every time I tired to record, the input kept playing my voice at 50% speed and frequency (or even less). I reset my settings and used the default model, but I can't seem to get it to work. My screenshot shown is with chunk and extra at minimum settings and every else at default.

Application Screenshot

image

Logs on console

[Voice Changer] Prepare done. [Voice Changer] Switching model.. [Voice Changer] Switching model..done [Voice Changer] Switching model.. [Voice Changer] Switching model..done Switch Model Type: <voice_changer.RVC.RVC.RVC object at 0x000001E740100610> [Voice Changer] update configuration: dstId 1 [Voice Changer] update configuration: crossFadeEndRate 1 [Voice Changer] update configuration: crossFadeOverlapSize 1024 [Voice Changer] update configuration: framework PyTorch [Voice Changer] update configuration: onnxExecutionProvider CPUExecutionProvider [Voice Changer] update configuration: f0Factor 1 [Voice Changer] update configuration: f0Detector harvest [Voice Changer] update configuration: serverOutputAudioSampleRate 48000 [Voice Changer] update configuration: serverInputAudioSampleRate 48000 [Voice Changer] update configuration: serverInputAudioBufferSize 24576 [Voice Changer] update configuration: serverOutputAudioBufferSize 24576 [Voice Changer] update configuration: serverInputDeviceId -1 [Voice Changer] update configuration: serverOutputDeviceId -1 [Voice Changer] update configuration: serverReadChunkSize 256 [Voice Changer] update configuration: serverInputAudioGain 1 [Voice Changer] update configuration: serverOutputAudioGain 1 [Voice Changer] update configuration: protect 0.5 [Voice Changer] update configuration: tran 10 [Voice Changer] update configuration: extraConvertSize 4096 [Voice Changer] update configuration: modelSamplingRate 48000 [Voice Changer] update configuration: silenceFront 1 [Voice Changer] update configuration: useDiff 1 [Voice Changer] update configuration: diffMethod dpm-solver [Voice Changer] update configuration: diffSpkId 1 [Voice Changer] update configuration: diffAcc 20 [Voice Changer] update configuration: kStep 120 [Voice Changer] update configuration: threshold -45 [Voice Changer] update configuration: inputSampleRate 48000 [Voice Changer] update configuration: modelSlotIndex 1704132087005 [Voice Changer] Prepare Model of slot: 5 inferencerTypeinferencerTypeinferencerTypeinferencerType pyTorchRVC gin_channels: 256 self.spk_embed_dim: 109 [Voice Changer] embeder type is not match EnumEmbedderTypes.hubert hubert_base [Voice Changer] generate new embedder. (not match) [Voice Changer] update configuration: modelSlotIndex 1704132089005 [Voice Changer] Prepare Model of slot: 5 inferencerTypeinferencerTypeinferencerTypeinferencerType pyTorchRVC [Voice Changer] Loading index... Try loading... model_dir\5\added_IVF1225_Flat_nprobe_1.index GENERATE INFERENCER <voice_changer.RVC.inferencer.RVCInferencer.RVCInferencer object at 0x000001E7400C02B0> GENERATE EMBEDDER <voice_changer.RVC.embedder.FairseqHubert.FairseqHubert object at 0x000001E74BE965C0> GENERATE PITCH EXTRACTOR <voice_changer.RVC.pitchExtractor.HarvestPitchExtractor.HarvestPitchExtractor object at 0x000001E74BE96CB0> [Voice Changer] Prepare done. [Voice Changer] Switching model.. [Voice Changer] Switching model..done gin_channels: 256 self.spk_embed_dim: 109 [Voice Changer] embeder type is not match EnumEmbedderTypes.hubert hubert_base [Voice Changer] generate new embedder. (not match) [Voice Changer] Loading index... Try loading... model_dir\5\added_IVF1225_Flat_nprobe_1.index GENERATE INFERENCER <voice_changer.RVC.inferencer.RVCInferencer.RVCInferencer object at 0x000001E7400C1AB0> GENERATE EMBEDDER <voice_changer.RVC.embedder.FairseqHubert.FairseqHubert object at 0x000001E7401C0D00> GENERATE PITCH EXTRACTOR <voice_changer.RVC.pitchExtractor.HarvestPitchExtractor.HarvestPitchExtractor object at 0x000001E7401C0A90> [Voice Changer] Prepare done. [Voice Changer] Switching model.. [Voice Changer] Switching model..done [Voice Changer] update configuration: serverReadChunkSize 2048 [Voice Changer] update configuration: recordIO 1 [IORecorder] delete old analyze file. C:\Users\LIMHEL~1\AppData\Local\Temp\tmpbqnxo3be\tmp_dir\in.wav [IORecorder] delete old analyze file. C:\Users\LIMHEL~1\AppData\Local\Temp\tmpbqnxo3be\tmp_dir\out.wav [Voice Changer] update configuration: recordIO 0 [Voice Changer] update configuration: modelSlotIndex 1704132209000 [Voice Changer] Prepare Model of slot: 0 inferencerTypeinferencerTypeinferencerTypeinferencerType onnxRVC [Voice Changer] embeder type is not match EnumEmbedderTypes.hubert hubert_base [Voice Changer] generate new embedder. (not match) [Voice Changer] Loading index... GENERATE INFERENCER <voice_changer.RVC.inferencer.OnnxRVCInferencer.OnnxRVCInferencer object at 0x000001E74014FD30> GENERATE EMBEDDER <voice_changer.RVC.embedder.FairseqHubert.FairseqHubert object at 0x000001E74014CA00> GENERATE PITCH EXTRACTOR <voice_changer.RVC.pitchExtractor.HarvestPitchExtractor.HarvestPitchExtractor object at 0x000001E74014CBB0> [Voice Changer] Prepare done. [Voice Changer] Switching model.. [Voice Changer] Switching model..done [Voice Changer] update configuration: recordIO 1 [IORecorder] delete old analyze file. C:\Users\LIMHEL~1\AppData\Local\Temp\tmpbqnxo3be\tmp_dir\in.wav [IORecorder] delete old analyze file. C:\Users\LIMHEL~1\AppData\Local\Temp\tmpbqnxo3be\tmp_dir\out.wav [Voice Changer] update configuration: recordIO 0