CarlGao4 / Demucs-Gui

A GUI for music separation AI demucs
GNU General Public License v3.0
499 stars 37 forks source link

Bug: Status failed #41

Closed jvolker closed 9 months ago

jvolker commented 9 months ago

happen

  1. I was on version 0.1 for a while. Everything was working
  2. I updated to version 1.0
  3. I drag and drop an mp3 file into the queue and press "start separation" using the default settings. The status of the file changes to "failed".

I'm on a Mac Book Pro M1 Silicon.

What is the platform you are using?

macOS

log

I've tried to open the log from the menu bar, but nothing happens or opens up.

CarlGao4 commented 9 months ago

Please attach your logs. You can find them in ~/.config/demucs-gui/logs (You can open the path by pressing Command+Shift+g in Finder)

jvolker commented 9 months ago

Thanks for getting back to me.

2023-11-27 13:25:00,297 (GuiMain.py) (Line 1071) [INFO] : Python version: 3.11.5 (main, Sep 11 2023, 08:27:25) [Clang 14.0.6 ]
2023-11-27 13:25:00,299 (GuiMain.py) (Line 1072) [INFO] : Demucs GUI version: 1.0.1
2023-11-27 13:25:00,334 (GuiMain.py) (Line 1073) [INFO] : System: macOS-10.16-x86_64-i386-64bit
2023-11-27 13:25:00,350 (GuiMain.py) (Line 1074) [INFO] : Architecture: 64bit
2023-11-27 13:25:00,358 (GuiMain.py) (Line 1075) [INFO] : CPU: i386
2023-11-27 13:25:00,360 (GuiMain.py) (Line 1076) [INFO] : CPU count: 10
2023-11-27 13:25:00,361 (GuiMain.py) (Line 1077) [INFO] : System memory: 68719476736 (64.0GB)
2023-11-27 13:25:00,363 (GuiMain.py) (Line 1080) [INFO] : System free memory: 2271621120 (2.116GB)
2023-11-27 13:25:00,364 (GuiMain.py) (Line 1083) [INFO] : System swap memory: 7516192768 (7.0GB)
2023-11-27 13:25:01,525 (shared.py) (Line 156) [INFO] : [1] Thread starter (separator.py) starts
2023-11-27 13:25:01,548 (GuiMain.py) (Line 1088) [DEBUG] : Supported styles: macOS, Windows, Fusion
2023-11-27 13:25:01,558 (GuiMain.py) (Line 1092) [DEBUG] : Current style: macos
2023-11-27 13:25:23,231 (audio.py) (Line 34) [INFO] : Soundfile version: 0.12.1
2023-11-27 13:25:23,247 (audio.py) (Line 35) [INFO] : libsndfile version: 1.1.0
2023-11-27 13:25:23,249 (audio.py) (Line 36) [INFO] : SoXR version: 0.3.7
2023-11-27 13:25:23,252 (audio.py) (Line 37) [INFO] : libsoxr version: 0.1.3-7-gacc0dac
2023-11-27 13:25:23,258 (separator.py) (Line 44) [INFO] : Demucs version: 4.1.0a2
2023-11-27 13:25:23,260 (separator.py) (Line 45) [INFO] : PyTorch version: 2.0.1
2023-11-27 13:25:29,090 (audio.py) (Line 49) [INFO] : ffmpeg -version output:
ffmpeg version n6.0-tessus Copyright (c) 2000-2023 the FFmpeg developers
built with Apple clang version 14.0.0 (clang-1400.0.29.202)
configuration: --cc=/usr/bin/clang --prefix=/opt/ffmpeg --extra-version=tessus --enable-avisynth --enable-fontconfig --enable-gpl --enable-libaom --enable-libass --enable-libbluray --enable-libdav1d --enable-libfreetype --enable-libgsm --enable-libmodplug --enable-libmp3lame --enable-libmysofa --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenh264 --enable-libopenjpeg --enable-libopus --enable-librubberband --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvmaf --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx264 --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi --enable-version3 --pkg-config-flags=--static --disable-ffplay --enable-ffmpeg --enable-ffprobe
libavutil      58.  2.100 / 58.  2.100
libavcodec     60.  3.100 / 60.  3.100
libavformat    60.  3.100 / 60.  3.100
libavdevice    60.  1.100 / 60.  1.100
libavfilter     9.  3.100 /  9.  3.100
libswscale      7.  1.100 /  7.  1.100
libswresample   4. 10.100 /  4. 10.100
libpostproc    57.  1.100 / 57.  1.100

2023-11-27 13:25:30,749 (audio.py) (Line 54) [INFO] : Using ffmpeg from /Applications/Demucs-GUI.app/Contents/Frameworks/ffmpeg/ffmpeg
2023-11-27 13:25:30,754 (audio.py) (Line 55) [INFO] : ffprobe -version output:
ffprobe version n6.0-tessus Copyright (c) 2007-2023 the FFmpeg developers
built with Apple clang version 14.0.0 (clang-1400.0.29.202)
configuration: --cc=/usr/bin/clang --prefix=/opt/ffmpeg --extra-version=tessus --enable-avisynth --enable-fontconfig --enable-gpl --enable-libaom --enable-libass --enable-libbluray --enable-libdav1d --enable-libfreetype --enable-libgsm --enable-libmodplug --enable-libmp3lame --enable-libmysofa --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenh264 --enable-libopenjpeg --enable-libopus --enable-librubberband --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvmaf --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx264 --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi --enable-version3 --pkg-config-flags=--static --disable-ffplay --enable-ffmpeg --enable-ffprobe
libavutil      58.  2.100 / 58.  2.100
libavcodec     60.  3.100 / 60.  3.100
libavformat    60.  3.100 / 60.  3.100
libavdevice    60.  1.100 / 60.  1.100
libavfilter     9.  3.100 /  9.  3.100
libswscale      7.  1.100 /  7.  1.100
libswresample   4. 10.100 /  4. 10.100
libpostproc    57.  1.100 / 57.  1.100

2023-11-27 13:25:31,803 (shared.py) (Line 162) [INFO] : [1] Thread starter (separator.py) ends
2023-11-27 13:25:39,759 (shared.py) (Line 156) [INFO] : [2] Thread loadModel (GuiMain.py) starts
2023-11-27 13:25:39,766 (GuiMain.py) (Line 368) [INFO] : Loading model htdemucs from repo /Applications/Demucs-GUI.app/Contents/Frameworks/pretrained
2023-11-27 13:25:40,409 (GuiMain.py) (Line 390) [INFO] : Model loaded within 0.6371s
2023-11-27 13:25:40,412 (GuiMain.py) (Line 391) [INFO] : Model: htdemucs
Repo: /Applications/Demucs-GUI.app/Contents/Frameworks/pretrained
Type: Bag of models
Audio channels: 2
Sample rate: 44100
Sources: drums, bass, other, vocals

Model 0:
    Type: HTDemucs
    Default segment: 7.8
    Weight: [1.0, 1.0, 1.0, 1.0]
2023-11-27 13:25:40,413 (separator.py) (Line 78) [INFO] : MPS backend is available
2023-11-27 13:25:40,414 (shared.py) (Line 162) [INFO] : [2] Thread loadModel (GuiMain.py) ends
2023-11-27 13:28:31,466 (shared.py) (Line 156) [INFO] : [3] Thread separate (separator.py) starts
2023-11-27 13:28:31,477 (separator.py) (Line 267) [INFO] : Start separating audio: mysong.mp3
2023-11-27 13:28:31,483 (separator.py) (Line 268) [INFO] : Parameters: segment=7.80 overlap=0.25 shifts=0
2023-11-27 13:28:31,490 (separator.py) (Line 269) [INFO] : Device: mps
2023-11-27 13:28:31,493 (audio.py) (Line 66) [DEBUG] : Reading audio with soundfile: /Users/myusername/Downloads/mysong.mp3
2023-11-27 13:28:31,778 (audio.py) (Line 82) [INFO] : Read audio /Users/myusername/Downloads/mysong.mp3: samplerate=48000 shape=(0, 2)
2023-11-27 13:28:31,780 (shared.py) (Line 162) [INFO] : [3] Thread separate (separator.py) ends
2023-11-27 14:48:32,543 (shared.py) (Line 156) [INFO] : [4] Thread separate (separator.py) starts
2023-11-27 14:48:32,549 (separator.py) (Line 267) [INFO] : Start separating audio: mysong.mp3
2023-11-27 14:48:32,557 (separator.py) (Line 268) [INFO] : Parameters: segment=7.80 overlap=0.25 shifts=0
2023-11-27 14:48:32,560 (separator.py) (Line 269) [INFO] : Device: cpu
2023-11-27 14:48:32,562 (audio.py) (Line 66) [DEBUG] : Reading audio with soundfile: /Users/myusername/Downloads/mysong.mp3
2023-11-27 14:48:32,774 (audio.py) (Line 82) [INFO] : Read audio /Users/myusername/Downloads/mysong.mp3: samplerate=48000 shape=(0, 2)
2023-11-27 14:48:32,778 (shared.py) (Line 162) [INFO] : [4] Thread separate (separator.py) ends
CarlGao4 commented 9 months ago

I'll fix it in the next version. Please convert your audio into flac format for now.

jvolker commented 9 months ago

Thank you!

CarlGao4 commented 9 months ago

Besides, can you provide logs about opening log folder? I'm quite curious about why opening logs could fail

jvolker commented 9 months ago

Unfortunately, this was all that I could find in the logs.

CarlGao4 commented 9 months ago

Please try whether the latest version 1.0.2 fixes the issues above. If so, please close the issue.

jvolker commented 9 months ago

Thanks for trying to fix this so quickly. Unfortunately, seperation it's now stuck with status "Reading". Opening logs works now.

2023-11-28 13:46:22,166 (GuiMain.py) (Line 1071) [INFO] : Python version: 3.11.5 (main, Sep 11 2023, 08:27:25) [Clang 14.0.6 ]
2023-11-28 13:46:22,167 (GuiMain.py) (Line 1072) [INFO] : Demucs GUI version: 1.0.2
2023-11-28 13:46:22,196 (GuiMain.py) (Line 1073) [INFO] : System: macOS-10.16-x86_64-i386-64bit
2023-11-28 13:46:22,209 (GuiMain.py) (Line 1074) [INFO] : Architecture: 64bit
2023-11-28 13:46:22,212 (GuiMain.py) (Line 1075) [INFO] : CPU: i386
2023-11-28 13:46:22,215 (GuiMain.py) (Line 1076) [INFO] : CPU count: 10
2023-11-28 13:46:22,216 (GuiMain.py) (Line 1077) [INFO] : System memory: 68719476736 (64.0GB)
2023-11-28 13:46:22,222 (GuiMain.py) (Line 1080) [INFO] : System free memory: 3218182144 (2.997GB)
2023-11-28 13:46:22,223 (GuiMain.py) (Line 1083) [INFO] : System swap memory: 6442450944 (6.0GB)
2023-11-28 13:46:23,183 (shared.py) (Line 161) [INFO] : [1] Thread starter (separator.py) starts
2023-11-28 13:46:23,205 (GuiMain.py) (Line 1088) [DEBUG] : Supported styles: macOS, Windows, Fusion
2023-11-28 13:46:23,209 (GuiMain.py) (Line 1092) [DEBUG] : Current style: macos
2023-11-28 13:46:37,575 (audio.py) (Line 34) [INFO] : Soundfile version: 0.12.1
2023-11-28 13:46:37,596 (audio.py) (Line 35) [INFO] : libsndfile version: 1.1.0
2023-11-28 13:46:37,598 (audio.py) (Line 36) [INFO] : SoXR version: 0.3.7
2023-11-28 13:46:37,600 (audio.py) (Line 37) [INFO] : libsoxr version: 0.1.3-7-gacc0dac
2023-11-28 13:46:37,602 (separator.py) (Line 45) [INFO] : Demucs version: 4.1.0a2
2023-11-28 13:46:37,604 (separator.py) (Line 46) [INFO] : PyTorch version: 2.0.1
2023-11-28 13:46:42,151 (audio.py) (Line 49) [INFO] : ffmpeg -version output:
ffmpeg version n6.0-tessus Copyright (c) 2000-2023 the FFmpeg developers
built with Apple clang version 14.0.0 (clang-1400.0.29.202)
configuration: --cc=/usr/bin/clang --prefix=/opt/ffmpeg --extra-version=tessus --enable-avisynth --enable-fontconfig --enable-gpl --enable-libaom --enable-libass --enable-libbluray --enable-libdav1d --enable-libfreetype --enable-libgsm --enable-libmodplug --enable-libmp3lame --enable-libmysofa --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenh264 --enable-libopenjpeg --enable-libopus --enable-librubberband --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvmaf --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx264 --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi --enable-version3 --pkg-config-flags=--static --disable-ffplay --enable-ffmpeg --enable-ffprobe
libavutil      58.  2.100 / 58.  2.100
libavcodec     60.  3.100 / 60.  3.100
libavformat    60.  3.100 / 60.  3.100
libavdevice    60.  1.100 / 60.  1.100
libavfilter     9.  3.100 /  9.  3.100
libswscale      7.  1.100 /  7.  1.100
libswresample   4. 10.100 /  4. 10.100
libpostproc    57.  1.100 / 57.  1.100

2023-11-28 13:46:43,449 (audio.py) (Line 54) [INFO] : Using ffmpeg from /Applications/Demucs-GUI.app/Contents/Frameworks/ffmpeg/ffmpeg
2023-11-28 13:46:43,453 (audio.py) (Line 55) [INFO] : ffprobe -version output:
ffprobe version n6.0-tessus Copyright (c) 2007-2023 the FFmpeg developers
built with Apple clang version 14.0.0 (clang-1400.0.29.202)
configuration: --cc=/usr/bin/clang --prefix=/opt/ffmpeg --extra-version=tessus --enable-avisynth --enable-fontconfig --enable-gpl --enable-libaom --enable-libass --enable-libbluray --enable-libdav1d --enable-libfreetype --enable-libgsm --enable-libmodplug --enable-libmp3lame --enable-libmysofa --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenh264 --enable-libopenjpeg --enable-libopus --enable-librubberband --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvmaf --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx264 --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi --enable-version3 --pkg-config-flags=--static --disable-ffplay --enable-ffmpeg --enable-ffprobe
libavutil      58.  2.100 / 58.  2.100
libavcodec     60.  3.100 / 60.  3.100
libavformat    60.  3.100 / 60.  3.100
libavdevice    60.  1.100 / 60.  1.100
libavfilter     9.  3.100 /  9.  3.100
libswscale      7.  1.100 /  7.  1.100
libswresample   4. 10.100 /  4. 10.100
libpostproc    57.  1.100 / 57.  1.100

2023-11-28 13:46:44,465 (shared.py) (Line 167) [INFO] : [1] Thread starter (separator.py) ends
2023-11-28 13:47:01,847 (shared.py) (Line 161) [INFO] : [2] Thread loadModel (GuiMain.py) starts
2023-11-28 13:47:01,851 (GuiMain.py) (Line 368) [INFO] : Loading model htdemucs from repo /Applications/Demucs-GUI.app/Contents/Frameworks/pretrained
2023-11-28 13:47:02,382 (GuiMain.py) (Line 390) [INFO] : Model loaded within 0.5048s
2023-11-28 13:47:02,386 (separator.py) (Line 79) [INFO] : MPS backend is available
2023-11-28 13:47:02,387 (GuiMain.py) (Line 391) [INFO] : Model: htdemucs
Repo: /Applications/Demucs-GUI.app/Contents/Frameworks/pretrained
Type: Bag of models
Audio channels: 2
Sample rate: 44100
Sources: drums, bass, other, vocals

Model 0:
 Type: HTDemucs
 Default segment: 7.8
 Weight: [1.0, 1.0, 1.0, 1.0]
2023-11-28 13:47:02,399 (shared.py) (Line 167) [INFO] : [2] Thread loadModel (GuiMain.py) ends
2023-11-28 13:47:26,218 (shared.py) (Line 161) [INFO] : [3] Thread separate (separator.py) starts
2023-11-28 13:47:26,226 (separator.py) (Line 284) [INFO] : Start separating audio: mysong.mp3
2023-11-28 13:47:26,228 (separator.py) (Line 285) [INFO] : Parameters: segment=7.80 overlap=0.25 shifts=0
2023-11-28 13:47:26,230 (separator.py) (Line 286) [INFO] : Device: mps
2023-11-28 13:47:26,232 (audio.py) (Line 66) [DEBUG] : Reading audio with soundfile: /Users/myusername/Downloads/mysong.mp3
2023-11-28 13:47:26,450 (audio.py) (Line 82) [INFO] : Read audio /Users/myusername/Downloads/mysong.mp3: samplerate=48000 shape=(0, 2)
2023-11-28 13:47:26,454 (audio.py) (Line 70) [ERROR] : Failed to read with soundfile:
Traceback (most recent call last):
  File "audio.py", line 68, in read_audio
  File "audio.py", line 83, in read_audio_soundfile
AssertionError: Audio is empty

2023-11-28 13:47:26,458 (audio.py) (Line 71) [DEBUG] : Reading audio with ffmpeg: /Users/myusername/Downloads/mysong.mp3
2023-11-28 13:47:26,478 (audio.py) (Line 98) [DEBUG] : ffprobe command: ffprobe -v level+warning -of xml -show_streams -show_format '/Users/myusername/Downloads/mysong.mp3'
2023-11-28 13:47:26,532 (audio.py) (Line 99) [INFO] : <?xml version="1.0" encoding="UTF-8"?>
<ffprobe>
    <streams>
        <stream index="0" codec_name="mp3" codec_long_name="MP3 (MPEG audio layer 3)" codec_type="audio" codec_tag_string="[0][0][0][0]" codec_tag="0x0000" sample_fmt="fltp" sample_rate="48000" channels="2" channel_layout="stereo" bits_per_sample="0" initial_padding="0" r_frame_rate="0/0" avg_frame_rate="0/0" time_base="1/14112000" start_pts="0" start_time="0.000000" duration_ts="2624154624" duration="185.952000" bit_rate="320000">
            <disposition default="0" dub="0" original="0" comment="0" lyrics="0" karaoke="0" forced="0" hearing_impaired="0" visual_impaired="0" clean_effects="0" attached_pic="0" timed_thumbnails="0" captions="0" descriptions="0" metadata="0" dependent="0" still_image="0"/>
        </stream>
        <stream index="1" codec_name="mjpeg" codec_long_name="Motion JPEG" profile="Baseline" codec_type="video" codec_tag_string="[0][0][0][0]" codec_tag="0x0000" width="640" height="640" coded_width="640" coded_height="640" closed_captions="0" film_grain="0" has_b_frames="0" sample_aspect_ratio="1:1" display_aspect_ratio="1:1" pix_fmt="yuvj444p" level="-99" color_range="pc" color_space="bt470bg" chroma_location="center" refs="1" r_frame_rate="90000/1" avg_frame_rate="0/0" time_base="1/90000" duration_ts="16735680" duration="185.952000" bits_per_raw_sample="8">
            <disposition default="0" dub="0" original="0" comment="0" lyrics="0" karaoke="0" forced="0" hearing_impaired="0" visual_impaired="0" clean_effects="0" attached_pic="1" timed_thumbnails="0" captions="0" descriptions="0" metadata="0" dependent="0" still_image="0"/>
            <tag key="title" value="[...]"/>
            <tag key="comment" value="Cover (front)"/>
        </stream>
    </streams>

    <format filename="/Users/myusername/Downloads/mysong.mp3" nb_streams="2" nb_programs="0" format_name="mp3" format_long_name="MP2/3 (MPEG audio layer 2/3)" start_time="0.000000" duration="185.952000" size="7564451" bit_rate="325436" probe_score="51">
        <tag key="title" value="[...]"/>
        <tag key="artist" value="[...]"/>
        <tag key="album" value="[...]"/>
        <tag key="n" value="[...]"/>
        <tag key="date" value="1970"/>
    </format>
</ffprobe>

2023-11-28 13:47:26,549 (audio.py) (Line 107) [DEBUG] : ffmpeg command: ffmpeg -v level+warning -i '/Users/myusername/Downloads/mysong.mp3' -map a:0 -ar 44100 -c:a pcm_f32le -f wav -
CarlGao4 commented 9 months ago

What about 1.0.2.1?

jvolker commented 9 months ago

Thank you so much! That worked. 👍