Anjok07 / ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.
MIT License
16.46k stars 1.24k forks source link

Setting to use GPU conversion #1438

Open khoatvd opened 2 days ago

khoatvd commented 2 days ago

Everytime I try to use GPU Conversion, the program says error. Pls see the info here: Model: MDX23C_D1581 Segment size: 2848 - Overlap: 2 CPU: 13th Gen Intel(R) Core(TM) i7-13650HX 2.60 GHz GPU: RTX 4060 8GB GDDR6 RAM: 16GB DDR5 4800MHz

The error is sth related to GPU memory. From the read me file I think my GPU can meet the recommend but somehow it keep showing error as below. Pls show me how I can use GPU for faster conversion.

Process: MDX-Net

The application was unable to allocate enough GPU memory to use this model. Please close any GPU intensive applications and try again. If the error persists, your GPU might not be supported.

Raw Error Details:

OutOfMemoryError: "CUDA out of memory. Tried to allocate 2.78 GiB (GPU 0; 8.00 GiB total capacity; 8.99 GiB already allocated; 0 bytes free; 11.54 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF" Traceback Error: " File "UVR.py", line 6638, in process_start File "separate.py", line 652, in seperate File "separate.py", line 771, in demix File "torch\nn\modules\module.py", line 1190, in _call_impl File "lib_v5\tfc_tdf_v3.py", line 235, in forward File "torch\nn\modules\module.py", line 1190, in _call_impl File "lib_v5\tfc_tdf_v3.py", line 142, in forward File "torch\nn\modules\module.py", line 1190, in _call_impl File "torch\nn\modules\container.py", line 204, in forward File "torch\nn\modules\module.py", line 1190, in _call_impl File "torch\nn\modules\activation.py", line 684, in forward "

Error Time Stamp [2024-07-01 10:20:01]

Full Application Settings:

vr_model: UVR-DeEcho-DeReverb aggression_setting: 6 window_size: 1024 mdx_segment_size: 2848 batch_size: Default crop_size: 256 is_tta: False is_output_image: False is_post_process: False is_high_end_process: False post_process_threshold: 0.2 vr_voc_inst_secondary_model: No Model Selected vr_other_secondary_model: No Model Selected vr_bass_secondary_model: No Model Selected vr_drums_secondary_model: No Model Selected vr_is_secondary_model_activate: False vr_voc_inst_secondary_model_scale: 0.9 vr_other_secondary_model_scale: 0.7 vr_bass_secondary_model_scale: 0.5 vr_drums_secondary_model_scale: 0.5 demucs_model: Choose Model segment: Default overlap: 0.25 overlap_mdx: Default overlap_mdx23: 2 shifts: 2 chunks_demucs: Auto margin_demucs: 44100 is_chunk_demucs: False is_chunk_mdxnet: False is_primary_stem_only_Demucs: False is_secondary_stem_only_Demucs: False is_split_mode: True is_demucs_combine_stems: True is_mdx23_combine_stems: True demucs_voc_inst_secondary_model: No Model Selected demucs_other_secondary_model: No Model Selected demucs_bass_secondary_model: No Model Selected demucs_drums_secondary_model: No Model Selected demucs_is_secondary_model_activate: False demucs_voc_inst_secondary_model_scale: 0.9 demucs_other_secondary_model_scale: 0.7 demucs_bass_secondary_model_scale: 0.5 demucs_drums_secondary_model_scale: 0.5 demucs_pre_proc_model: No Model Selected is_demucs_pre_proc_model_activate: False is_demucs_pre_proc_model_inst_mix: False mdx_net_model: MDX23C-InstVoc D1581 chunks: Auto margin: 44100 compensate: Auto denoise_option: None is_match_frequency_pitch: True phase_option: Automatic phase_shifts: None is_save_align: False is_match_silence: True is_spec_match: False is_mdx_c_seg_def: False is_invert_spec: False is_deverb_vocals: False deverb_vocal_opt: Main Vocals Only voc_split_save_opt: Lead Only is_mixer_mode: False mdx_batch_size: Default mdx_voc_inst_secondary_model: No Model Selected mdx_other_secondary_model: No Model Selected mdx_bass_secondary_model: No Model Selected mdx_drums_secondary_model: No Model Selected mdx_is_secondary_model_activate: False mdx_voc_inst_secondary_model_scale: 0.9 mdx_other_secondary_model_scale: 0.7 mdx_bass_secondary_model_scale: 0.5 mdx_drums_secondary_model_scale: 0.5 is_save_all_outputs_ensemble: True is_append_ensemble_name: False chosen_audio_tool: Manual Ensemble choose_algorithm: Min Spec time_stretch_rate: 2.0 pitch_rate: 2.0 is_time_correction: True is_gpu_conversion: True is_primary_stem_only: False is_secondary_stem_only: True is_testing_audio: False is_auto_update_model_params: True is_add_model_name: False is_accept_any_input: False is_task_complete: False is_normalization: False is_use_opencl: False is_wav_ensemble: False is_create_model_folder: False mp3_bit_set: 320k semitone_shift: 0 save_format: WAV wav_type_set: PCM_32 device_set: NVIDIA GeForce RTX 4060 Laptop GPU:0 help_hints_var: True set_vocal_splitter: No Model Selected is_set_vocal_splitter: False is_save_inst_set_vocal_splitter: False model_sample_mode: False model_sample_mode_duration: 30 demucs_stems: All Stems mdx_stems: Vocals

ChrisTG742 commented 2 days ago

I think the underlying problem is the memory consuption of the VR-models. This seems to exceed the 8GB RAM your graphics-card offers. The currently processed data (block) must fit into memory completely. This is valid for CPU and for GPU-based processing. You can try to lower the "window-size", but I doubt that settings below 256 will give a satisfying result. I believe this is nothing the devs of this project could solve. This must be solved by maintainers of the used libraries like PyTorch: https://github.com/pytorch/pytorch

ReneRu commented 17 hours ago

I have the exact same issue, and my system runs on RTX 4090. I mean, I can do anything hardcore demanding, but for some reason, it cannot handle that.

_Last Error Received:

Process: Ensemble Mode

The application was unable to allocate enough GPU memory to use this model. Please close any GPU intensive applications and try again. If the error persists, your GPU might not be supported.

Raw Error Details:

OutOfMemoryError: "CUDA out of memory. Tried to allocate 35.38 GiB (GPU 0; 23.99 GiB total capacity; 38.07 GiB already allocated; 0 bytes free; 38.68 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF" Traceback Error: " File "UVR.py", line 6638, in process_start File "separate.py", line 856, in seperate File "separate.py", line 1001, in demix_demucs File "demucs\apply.py", line 196, in apply_model File "demucs\apply.py", line 222, in apply_model File "demucs\apply.py", line 256, in apply_model File "demucs\utils.py", line 490, in result File "demucs\apply.py", line 271, in apply_model File "torch\nn\modules\module.py", line 1190, in _call_impl File "demucs\htdemucs.py", line 593, in forward File "torch\nn\modules\module.py", line 1190, in _call_impl File "demucs\transformer.py", line 667, in forward File "torch\nn\modules\module.py", line 1190, in _call_impl File "demucs\transformer.py", line 365, in forward File "torch\nn\modules\transformer.py", line 546, in _sa_block File "torch\nn\modules\module.py", line 1190, in _call_impl File "torch\nn\modules\activation.py", line 1167, in forward File "torch\nn\functional.py", line 5161, in multi_head_attention_forward File "torch\nn\functional.py", line 1841, in softmax "

Error Time Stamp [2024-07-02 21:11:02]

Full Application Settings:

vr_model: Choose Model aggression_setting: 5 window_size: 512 mdx_segment_size: 256 batch_size: Default crop_size: 256 is_tta: False is_output_image: False is_post_process: False is_high_end_process: False post_process_threshold: 0.2 vr_voc_inst_secondary_model: No Model Selected vr_other_secondary_model: No Model Selected vr_bass_secondary_model: No Model Selected vr_drums_secondary_model: No Model Selected vr_is_secondary_model_activate: False vr_voc_inst_secondary_model_scale: 0.9 vr_other_secondary_model_scale: 0.7 vr_bass_secondary_model_scale: 0.5 vr_drums_secondary_model_scale: 0.5 demucs_model: Choose Model segment: 100 overlap: 0.25 overlap_mdx: Default overlap_mdx23: 50 shifts: 2 chunks_demucs: Auto margin_demucs: 44100 is_chunk_demucs: False is_chunk_mdxnet: False is_primary_stem_only_Demucs: False is_secondary_stem_only_Demucs: False is_split_mode: True is_demucs_combine_stems: True is_mdx23_combine_stems: True demucs_voc_inst_secondary_model: No Model Selected demucs_other_secondary_model: No Model Selected demucs_bass_secondary_model: No Model Selected demucs_drums_secondary_model: No Model Selected demucs_is_secondary_model_activate: False demucs_voc_inst_secondary_model_scale: 0.9 demucs_other_secondary_model_scale: 0.7 demucs_bass_secondary_model_scale: 0.5 demucs_drums_secondary_model_scale: 0.5 demucs_pre_proc_model: No Model Selected is_demucs_pre_proc_model_activate: False is_demucs_pre_proc_model_inst_mix: False mdx_net_model: UVR-MDX-NET Inst HQ 3 chunks: Auto margin: 44100 compensate: Auto denoise_option: None is_match_frequency_pitch: True phase_option: Automatic phase_shifts: None is_save_align: False is_match_silence: True is_spec_match: False is_mdx_c_seg_def: False is_invert_spec: False is_deverb_vocals: False deverb_vocal_opt: Main Vocals Only voc_split_save_opt: Lead Only is_mixer_mode: False mdx_batch_size: Default mdx_voc_inst_secondary_model: No Model Selected mdx_other_secondary_model: No Model Selected mdx_bass_secondary_model: No Model Selected mdx_drums_secondary_model: No Model Selected mdx_is_secondary_model_activate: False mdx_voc_inst_secondary_model_scale: 0.9 mdx_other_secondary_model_scale: 0.7 mdx_bass_secondary_model_scale: 0.5 mdx_drums_secondary_model_scale: 0.5 is_save_all_outputs_ensemble: True is_append_ensemble_name: False chosen_audio_tool: Manual Ensemble choose_algorithm: Min Spec time_stretch_rate: 2.0 pitch_rate: 2.0 is_time_correction: True is_gpu_conversion: True is_primary_stem_only: False is_secondary_stem_only: True is_testing_audio: False is_auto_update_model_params: True is_add_model_name: False is_accept_any_input: False is_task_complete: False is_normalization: False is_use_opencl: False is_wav_ensemble: False is_create_model_folder: False mp3_bit_set: 320k semitone_shift: 0 save_format: WAV wav_type_set: 64-bit Float device_set: NVIDIA GeForce RTX 4090:0 help_hints_var: True set_vocal_splitter: No Model Selected is_set_vocal_splitter: False is_save_inst_set_vocal_splitter: False model_sample_mode: False model_sample_mode_duration: 30 demucs_stems: All Stems mdxstems: Vocals