ina-foss / inaSpeechSegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
MIT License
736 stars 126 forks source link

problem with ffmpg #78

Closed felixbur closed 1 year ago

felixbur commented 1 year ago

Please fill out this template for a bug report.

Make sure you ran the unit tests before submitting an issue and tell us if and where they fail.

System information

System information required if using GPU acceleration

Expected Behavior

Current Behavior

File ~/research/tmp/inaspeech_segmenter/venv/lib/python3.8/site-packages/inaSpeechSegmenter/segmenter.py:278, in Segmenter.call(self, medianame, tmpdir, start_sec, stop_sec) ... ---> 94 assert p.returncode == 0, error 96 # Get Mel Power Spectrogram and Energy 97 return _wav2feats(tmpwav)

AssertionError: b'ffmpeg version 4.2.7-0ubuntu0.1 Copyright (c) 2000-2022 the FFmpeg developers\n built with gcc 9 (Ubuntu 9.4.0-1ubuntu1~20.04.1)\n configuration: --prefix=/usr --extra-version=0ubuntu0.1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-avisynth --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-nvenc --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared\n libavutil 56. 31.100 / 56. 31.100\n libavcodec 58. 54.100 / 58. 54.100\n libavformat 58. 29.100 / 58. 29.100\n libavdevice 58. 8.100 / 58. 8.100\n libavfilter 7. 57.100 / 7. 57.100\n libavresample 4. 0. 0 / 4. 0. 0\n libswscale 5. 5.100 / 5. 5.100\n libswresample 3. 5.100 / 3. 5.100\n libpostproc 55. 5.100 / 55. 5.100\nn: No such file or directory\n'

Steps to Reproduce

1. 2. 3. 4.

Additional infos

bagustris commented 1 year ago

@felixbur I believe this error due to typo on audio file name (see the last error message: No such file or directory). For instance, if I change the filename from musanmix.mp3 to musanmi.mp3 in the example, I got the same error message.

felixbur commented 1 year ago

confirmed, it was a wrong path and nothing to do with ffmpeg