w-okada / voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer
Other
16.15k stars 1.76k forks source link

[Question] No Audio output on Linux with Pipewire. Problem on my end? #758

Closed veprogames closed 1 year ago

veprogames commented 1 year ago

Issue Type

Question

vc client version number

Installed manually

OS

Linux 6.4.11-arch2-1 x86_64 GNU/Linux

GPU

NVIDIA GeForce RTX 2060 SUPER (8GB)

Clear setting

yes

Sample model

yes

Input chunk num

yes

Wait for a while

The GUI successfully launched.

read tutorial

yes

Extract files to a new folder.

yes

Voice Changer type

RVC

Model type

pyTorch (pth)

Situation

Setup

(mamba is based on libconda so should be no problem) directory is repo/server directory

mamba create -n voicechanger -c conda-forge python=3.10
mamba activate voicechanger
pip install -U pip numpy wheel
pip install -r requirements.txt --no-build-isolation
python MMVCServerSIO.py -p 18888 --https true \\
    --content_vec_500 pretrain/checkpoint_best_legacy_500.pt  \\
    --content_vec_500_onnx pretrain/content_vec_500.onnx \\
    --content_vec_500_onnx_on true \\
    --hubert_base pretrain/hubert_base.pt \\
    --hubert_base_jp pretrain/rinna_hubert_base_jp.pt \\
    --hubert_soft pretrain/hubert/hubert-soft-0d54a1f4.pt \\
    --nsf_hifigan pretrain/nsf_hifigan/model \\
    --crepe_onnx_full pretrain/crepe_onnx_full.onnx \\
    --crepe_onnx_tiny pretrain/crepe_onnx_tiny.onnx \\
    --rmvpe pretrain/rmvpe.pt \\
    --model_dir model_dir \\
    --samples samples.json

pactl info:

Server String: /run/user/1000/pulse/native
Library Protocol Version: 35
Server Protocol Version: 35
Is Local: yes
Client Index: 1111
Tile Size: 65472
[...]
Server Name: PulseAudio (on PipeWire 0.3.77)
Server Version: 15.0.0
Default Sample Specification: float32le 2ch 48000Hz
Default Channel Map: front-left,front-right
Default Sink: alsa_output.usb-ROCCAT_ROCCAT_Khan_AIMO-00.analog-stereo
Default Source: alsa_input.usb-ROCCAT_ROCCAT_Khan_AIMO-00.analog-stereo
Cookie: [...]

pacman -Qs pipewire

local/helvum 0.4.1-1
    GTK patchbay for PipeWire
local/libpipewire 1:0.3.77-2
    Low-latency audio/video router and processor - client library
local/libwireplumber 0.4.14-1
    Session / policy manager implementation for PipeWire - client library
local/pipewire 1:0.3.77-2
    Low-latency audio/video router and processor
local/pipewire-alsa 1:0.3.77-2
    Low-latency audio/video router and processor - ALSA configuration
local/pipewire-audio 1:0.3.77-2
    Low-latency audio/video router and processor - Audio support
local/pipewire-jack 1:0.3.77-2
    Low-latency audio/video router and processor - JACK support
local/pipewire-pulse 1:0.3.77-2
    Low-latency audio/video router and processor - PulseAudio replacement
local/wireplumber 0.4.14-1
    Session / policy manager implementation for PipeWire

application window capture

No response

logs on terminal

    Booting PHASE :__main__
    PYTHON:3.10.12 | packaged by conda-forge | (main, Jun 23 2023, 22:40:32) [GCC 12.3.0]
    Activating the Voice Changer.
[Voice Changer] download sample catalog. samples_0003_t2.json
[Voice Changer] download sample catalog. samples_0003_o2.json
[Voice Changer] download sample catalog. samples_0003_d2.json
[Voice Changer] model_dir is already exists. skip download samples.
    Internal_Port:18888
    protocol: HTTPS(self-signed), key:keys/20230825_201649.key, cert:keys/20230825_201649.cert
    -- ---- -- 
    Please open the following URL in your browser.
    https://<IP>:<PORT>/
    In many cases, it will launch when you access any of the following URLs.
    https://127.0.0.1:18888/
    https://192.[...]:18888/
    Booting PHASE :__mp_main__
    The server process is starting up.
    Booting PHASE :MMVCServerSIO
[Voice Changer] model slot is changed -1 -> 3
................RVC
[Voice Changer] [RVCr2] Creating instance 
VoiceChangerV2 Initialized (GPU_NUM(cuda):1, mps_enabled:False, onnx_device:GPU)
[Voice Changer][RVC]: update_settings gpu:0
[Voice Changer][RVCr2] Initializing... 
2023-08-25 20:16:52.344220650 [W:onnxruntime:, session_state.cc:1030 VerifyEachNodeIsAssignedToAnEp] Some nodes were not assigned to the preferred execution providers which may or may not have an negative impact on performance. e.g. ORT explicitly assigns shape related ops to CPU to improve perf.
2023-08-25 20:16:52.344235139 [W:onnxruntime:, session_state.cc:1032 VerifyEachNodeIsAssignedToAnEp] Rerunning with verbose output on a non-minimal build will show node assignments.
[Voice Changer] generate new embedder. (no embedder)
[Voice Changer] use torch contentvec Not implemented
[Voice Changer] Loading index...
Try loading... model_dir/3/added_IVF2736_Flat_nprobe_1_v2.index.bin
GENERATE INFERENCER<voice_changer.RVC.inferencer.OnnxRVCInferencer.OnnxRVCInferencer object at 0x7efcc02b4ca0>
GENERATE EMBEDDER<voice_changer.RVC.embedder.FairseqHubert.FairseqHubert object at 0x7efcc02b4dc0>
GENERATE PITCH EXTRACTOR<voice_changer.RVC.pitchExtractor.HarvestPitchExtractor.HarvestPitchExtractor object at 0x7efcc02b4e80>
[Voice Changer] [RVC] Initializing... done
[Voice Changer][RVC]: update_settings enableServerAudio:1
[Voice Changer][RVC]: update_settings serverInputDeviceId:7
[Voice Changer][RVC]: update_settings serverOutputDeviceId:13
[Voice Changer][RVC]: update_settings f0Detector:rmvpe
[Voice Changer][RVC]: update_settings serverMonitorDeviceId:-1
[Voice Changer][RVC]: update_settings serverAudioSampleRate:48000
[Voice Changer][RVC]: update_settings serverOutputAudioGain:0.9
[Voice Changer][RVC]: update_settings serverInputAudioGain:3.5
[Voice Changer][RVC]: update_settings serverReadChunkSize:16
[Voice Changer][RVC]: update_settings extraConvertSize:8192
[Voice Changer][RVC]: update_settings modelSlotIndex:1692986920003
[2023-08-25 20:17:00] connet sid : iM50N9SYVviAm37sAAAB
[2023-08-25 20:17:00] connet sid : n6Flw6mjOEnWTGdkAAAD
[Voice Changer] update configuration: serverOutputDeviceId 7
[Voice Changer][RVC]: update_settings serverOutputDeviceId:7
[Voice Changer] update configuration: serverAudioStated 1
[Voice Changer][RVC]: update_settings serverAudioStated:1
Devices:
  [Input]: ServerAudioDevice(kind='audioinput', index=7, name='ROCCAT Khan AIMO Analog Stereo', hostAPI='JACK Audio Connection Kit', maxInputChannels=4, maxOutputChannels=2, default_samplerate=48000.0, available_samplerates=[]) None
  [Output]: ServerAudioDevice(kind='audiooutput', index=7, name='ROCCAT Khan AIMO Analog Stereo', hostAPI='JACK Audio Connection Kit', maxInputChannels=4, maxOutputChannels=2, default_samplerate=48000.0, available_samplerates=[]) None
  [Monitor]: None None
Sample Rate:
  [Model]: 40000
  [Input]: 48000 -> True
  [Output]: 48000 -> True
[Voice Changer] server audio performance []
                status: started:1, model_sr:40000, chunk:16
                input  : id:7, sr:48000, ch:4
                output : id:7, sr:48000, ch:2
[Voice Changer] server audio performance []
                status: started:1, model_sr:40000, chunk:16
                input  : id:7, sr:48000, ch:4
                output : id:7, sr:48000, ch:2
[Voice Changer] server audio performance []
                status: started:1, model_sr:40000, chunk:16
                input  : id:7, sr:48000, ch:4
                output : id:7, sr:48000, ch:2
[Voice Changer] server audio performance []
                status: started:1, model_sr:40000, chunk:16
                input  : id:7, sr:48000, ch:4
                output : id:7, sr:48000, ch:2
[Voice Changer] server audio performance []
                status: started:1, model_sr:40000, chunk:16
                input  : id:7, sr:48000, ch:4
                output : id:7, sr:48000, ch:2
[Voice Changer] update configuration: serverAudioStated 0
[Voice Changer][RVC]: update_settings serverAudioStated:0
[Voice Changer] server audio performance []
                status: started:0, model_sr:40000, chunk:16
                input  : id:7, sr:48000, ch:4
                output : id:7, sr:48000, ch:2
serverAudioStarted Changed: 0
w-okada commented 1 year ago

I have no idea. try https://github.com/w-okada/voice-changer/blob/master/tutorials/trouble_shoot_communication_ja.md

veprogames commented 1 year ago

Sorry for confusion. Turns out something silently fails when using Firefox based browser. Works fine with Ungoogled Chromium.

Closing.