w-okada / voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer
Other
16.63k stars 1.81k forks source link

Add a possiblity to lower monitor gain even lower to 0.01 from current 0.1. #699

Closed Smokyyy closed 1 year ago

Smokyyy commented 1 year ago

Issue Type

Feature Request

vc client version number

MMVCServerSIO_win_onnxgpu-cuda_v.1.5.3.12a

OS

Windows 10 Pro, 22H2, 19045.3208

GPU

GTX 1080

Clear setting

no

Sample model

yes

Input chunk num

yes

Wait for a while

The GUI successfully launched.

read tutorial

yes

Extract files to a new folder.

no

Voice Changer type

RVC

Model type

pyTorchRVCv2, f0

Situation

Create a slider within the server's audio mode to independently adjust the volume of the monitor output audio. I want a built-in way to output my voice-changed audio through my headphones at a low volume, in order to hear myself speak without distractions.

application window capture

No response

logs on terminal

D:\Program Files\Voice Changer\MMVCServerSIO>MMVCServerSIO.exe -p 18888 --https false --content_vec_500 pretrain/checkpoint_best_legacy_500.pt --content_vec_500_onnx pretrain/content_vec_500.onnx --content_vec_500_onnx_on true --hubert_base pretrain/hubert_base.pt --hubert_base_jp pretrain/rinna_hubert_base_jp.pt --hubert_soft pretrain/hubert/hubert-soft-0d54a1f4.pt --nsf_hifigan pretrain/nsf_hifigan/model --crepe_onnx_full pretrain/crepe_onnx_full.onnx --crepe_onnx_tiny pretrain/crepe_onnx_tiny.onnx --rmvpe pretrain/rmvpe.pt --model_dir model_dir --samples samples.json Booting PHASE :main PYTHON:3.10.11 (tags/v3.10.11:7d4cc5a, Apr 5 2023, 00:38:17) [MSC v.1929 64 bit (AMD64)] Activating the Voice Changer. [Voice Changer] download sample catalog. samples_0003_t2.json [Voice Changer] download sample catalog. samples_0003_o2.json [Voice Changer] download sample catalog. samples_0003_d2.json [Voice Changer] model_dir is already exists. skip download samples. Internal_Port:18888 protocol: HTTP


Please open the following URL in your browser.
http://<IP>:<PORT>/
In many cases, it will launch when you access any of the following URLs.
http://127.0.0.1:18888/

[VCClient] Access http://127.0.0.1:18888/ [VCClient] wait web server...0 http://127.0.0.1:18888/ Booting PHASE :main Booting PHASE :MMVCServerSIO [Voice Changer] model slot is changed -1 -> 8 ................RVC [Voice Changer] [RVCr2] Creating instance VoiceChangerV2 Initialized (GPU_NUM(cuda):1, mps_enabled:False, onnx_device:GPU) [Voice Changer][RVC]: update_settings gpu:0 [Voice Changer][RVCr2] Initializing... gin_channels: 256 self.spk_embed_dim: 109 [Voice Changer] generate new embedder. (no embedder) 2023-08-12 21:54:51.2896296 [W:onnxruntime:, session_state.cc:1030 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Some nodes were not assigned to the preferred execution providers which may or may not have an negative impact on performance. e.g. ORT explicitly assigns shape related ops to CPU to improve perf. 2023-08-12 21:54:51.2980570 [W:onnxruntime:, session_state.cc:1032 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Rerunning with verbose output on a non-minimal build will show node assignments. [Voice Changer] Loading index... Try loading... model_dir\8\added_IVF1970_Flat_nprobe_1_EvaElfie_v2.index GENERATE INFERENCER<voice_changer.RVC.inferencer.RVCInferencerv2.RVCInferencerv2 object at 0x000001945C7EFCD0> GENERATE EMBEDDER<voice_changer.RVC.embedder.OnnxContentvec.OnnxContentvec object at 0x000001945E7F3B80> GENERATE PITCH EXTRACTOR<voice_changer.RVC.pitchExtractor.HarvestPitchExtractor.HarvestPitchExtractor object at 0x000001945E7F3BB0> [Voice Changer] [RVC] Initializing... done [Voice Changer][RVC]: update_settings serverReadChunkSize:192 [Voice Changer][RVC]: update_settings f0Detector:rmvpe [VCClient] wait web server...10 http://127.0.0.1:18888/ [VCClient] wait web server...20 http://127.0.0.1:18888/ [Voice Changer][RVC]: update_settings silentThreshold:0.00015 [Voice Changer][RVC]: update_settings enableServerAudio:1 [Voice Changer][RVC]: update_settings serverAudioSampleRate:48000 [Voice Changer][RVC]: update_settings serverInputDeviceId:14 [Voice Changer][RVC]: update_settings serverOutputDeviceId:13 [Voice Changer][RVC]: update_settings extraConvertSize:32768 [Voice Changer][RVC]: update_settings serverOutputAudioGain:3.3 [Voice Changer][RVC]: update_settings serverInputAudioGain:2.6 [Voice Changer][RVC]: update_settings serverMonitorDeviceId:12 [Voice Changer][RVC]: update_settings modelSlotIndex:1691866298008 [VCClient] wait web server... done 200 [2023-08-12 21:55:13] connet sid : jp7mI95NrmNNTNBNAAAC [2023-08-12 21:55:13] connet sid : s5CxD9Ki_vKhG7yXAAAD

Smokyyy commented 1 year ago

Actually nvm, I'm stupid, the gain below monitor output does what I want.