4re / vapoursynth-portage

Unofficial portage tree with all VapourSynth related ebuilds
25 stars 11 forks source link

Trouble running svp4linux with mpv + vapoursynth #119

Closed dbkblk closed 1 year ago

dbkblk commented 1 year ago

Hi, I'm not very experienced in running Gentoo (but not a Linux beginner) and I'm having troubles with running svp4linux with vapoursynth from that repository.

When running a video, mpv crash:

mpv video.mkv 
 (+) Video --vid=1 (*) (h264 1920x808 23.976fps)
 (+) Audio --aid=1 --alang=fre (*) (ac3 6ch 48000Hz)
     Audio --aid=2 --alang=eng (aac 6ch 48000Hz)
 (+) Subs  --sid=1 --slang=fre (*) 'Forced' (subrip)
     Subs  --sid=2 --slang=fre 'French' (subrip)
AO: [pipewire] 48000Hz 5.1(side) 6ch floatp
VO: [gpu] 1920x808 yuv420p
mesa: CommandLine Error: Option 'h' registered more than once!150MB
LLVM ERROR: inconsistency in registered CommandLine options
Abandon (core dumped)
[U] media-video/mpv
     Available versions:  0.34.1-r1^t 0.35.0-r1(0/2)^t **9999(0/2)*l^t {+X +alsa aqua archive bluray cdda +cli coreaudio cplugins debug doc (+)drm dvb dvd +egl gamepad gbm +iconv jack javascript jpeg lcms libcaca (+)libmpv +libplacebo +lua mmal nvenc openal (+)opengl pipewire pulseaudio raspberry-pi rubberband sdl selinux sixel sndio test tools +uchardet vaapi vdpau vulkan wayland +xv zimg zlib LUA_SINGLE_TARGET="lua5-1 luajit" PYTHON_SINGLE_TARGET="python3_9 python3_10 python3_11" PYTHON_TARGETS="python3_9 python3_10 python3_11"}
     Installed versions:  0.35.0(0/2)^t[?](18:41:44 23/01/2023)(X alsa cdda cli drm dvd egl iconv jpeg lcms libmpv libplacebo lua opengl pipewire pulseaudio uchardet vaapi vapoursynth vulkan wayland xv zimg zlib -aqua -archive -bluray -coreaudio -debug -dvb -gamepad -jack -javascript -libcaca -mmal -nvenc -openal -raspberry-pi -rubberband -sdl -selinux -sndio -test -tools -vdpau LUA_SINGLE_TARGET="luajit -lua5-1" PYTHON_SINGLE_TARGET="python3_10 -python3_9 -python3_11")
     Homepage:            https://mpv.io/
     Description:         Media player for the command line
[I] sys-devel/llvm
     Available versions:  
     (13)   13.0.1^t
     (14)   14.0.6-r2^t
     (15)   15.0.6-r1(15/15)^t ~15.0.7(15/15)^t
     (16)   ~*16.0.0_pre20230101(16/16.0.0_pre20230101)^t ~*16.0.0_pre20230107(16/16.0.0_pre20230107)^t **16.0.0.9999(16/16.0.0.9999)*l^t
       {+binutils-plugin debug doc exegesis libedit +libffi ncurses test verify-sig xar xml z3 zstd ABI_MIPS="n32 n64 o32" ABI_S390="32 64" ABI_X86="32 64 x32" LLVM_TARGETS="AArch64 AMDGPU ARC ARM AVR BPF CSKY DirectX Hexagon Lanai LoongArch M68k MSP430 Mips NVPTX PowerPC RISCV SPIRV Sparc SystemZ VE WebAssembly X86 XCore Xtensa"}
     Installed versions:  15.0.6-r1(15/15)^t(17:01:08 19/01/2023)(binutils-plugin libffi ncurses xml -debug -doc -exegesis -libedit -test -verify-sig -xar -z3 -zstd ABI_MIPS="-n32 -n64 -o32" ABI_S390="-32 -64" ABI_X86="32 64 -x32" LLVM_TARGETS="AArch64 AMDGPU ARM AVR BPF Hexagon Lanai MSP430 Mips NVPTX PowerPC RISCV Sparc SystemZ VE WebAssembly X86 XCore -ARC -CSKY -DirectX -LoongArch -M68k -SPIRV")
     Homepage:            https://llvm.org/
     Description:         Low Level Virtual Machine
equery uses vapoursynth
[ Legend : U - final flag setting for installation]
[        : I - package is installed with flag     ]
[ Colors : set, unset                             ]
 * Found these USE flags for media-libs/vapoursynth-61:
 U I
 + + cpu_flags_x86_sse2              : Use the SSE2 instruction set
 - - debug                           : Enable compilation options required for debugging
 - - doc                             : Install project documentation
 - - guard                           : Adds 32 bytes on the left and the right sides of each frame, fills them with a certain value, and checks their integrity
                                       after each filter. It can be used to detect buggy filters that write a little outside the frame
 + + python_single_target_python3_10 : Build for Python 3.10 only
 - - python_single_target_python3_11 : Build for Python 3.11 only
 - - python_single_target_python3_9  : Build for Python 3.9 only
 + + shared                          : Build shared libraries
 + + static-libs                     : Build static versions of dynamic libraries as well
 + + vspipe                          : Enable support for piping output

Could you point me in the right direction, please?

4re commented 1 year ago

This overlay doesn't support svp4linux.

Anyway, it should work fine, so there may be something wrong with the prerequisites or your system.

Have you followed the install guide on their page and are you sure your OpenCL installation is working correctly? If you launch /opt/SVPManager is it not working too?

That being said, you probably will get better support on their website as this issue doesn't look related to this ebuild at all.

dbkblk commented 1 year ago

Thank you for your answer. I'm pretty sure that OpenCL is working properly, but I don't exactly know how to test this. Yes, it's not working once SVPManager is on. The module for band detection is working (with color completion), but the one for image interpolation crash mpv.

I've found out that there is an old bug report here https://bugs.gentoo.org/721958 that have the same symptoms, but the guy said it was fixed with a new version of llvm. Maybe that's a linking problem with llvm.

Let's keep this bug report open and I'll close it once I'll find the solution.

4re commented 1 year ago

I'm very hesitant to install binaries of people suspected of violating the GPL license in my system but I tested svp4linux and it works with no problem here. What drivers and useflags with them are you using? You can test OpenCL with clinfo if I'm not mistaken.

dbkblk commented 1 year ago

Yes, I know it's not GPL, but it's the only reliant way of interpolating images I've found. Here are the details:

$ equery uses xorg-drivers
[ Legend : U - final flag setting for installation]
[        : I - package is installed with flag     ]
[ Colors : set, unset                             ]
 * Found these USE flags for x11-base/xorg-drivers-21.1-r2:
 U I
 - - input_devices_elographics : INPUT_DEVICES setting to build driver for elographics input devices
 - - input_devices_evdev       : INPUT_DEVICES setting to build driver for evdev input devices
 - - input_devices_joystick    : INPUT_DEVICES setting to build driver for joystick input devices
 + + input_devices_libinput    : INPUT_DEVICES setting to build driver for libinput input devices
 - - input_devices_synaptics   : INPUT_DEVICES setting to build driver for synaptics input devices
 - - input_devices_vmmouse     : INPUT_DEVICES setting to build driver for vmmouse input devices
 - - input_devices_void        : INPUT_DEVICES setting to build driver for void/dummy input devices
 - - input_devices_wacom       : INPUT_DEVICES setting to build driver for wacom input devices
 + + video_cards_amdgpu        : VIDEO_CARDS setting to build driver for AMDGPU video cards
 - - video_cards_ast           : VIDEO_CARDS setting to build driver for ASpeedTech video cards
 - - video_cards_dummy         : VIDEO_CARDS setting to build driver for dummy video cards
 - - video_cards_fbdev         : VIDEO_CARDS setting to build driver for fbdev video cards
 - - video_cards_i915          : VIDEO_CARDS setting to build driver for Intel i915 video cards
 - - video_cards_intel         : VIDEO_CARDS setting to build driver for Intel video cards
 - - video_cards_mga           : VIDEO_CARDS setting to build driver for mga video cards
 - - video_cards_nouveau       : VIDEO_CARDS setting to build reverse-engineered driver for nvidia cards
 - - video_cards_nvidia        : VIDEO_CARDS setting to build driver for nvidia video cards
 - - video_cards_qxl           : VIDEO_CARDS setting to build driver for qxl (QEMU virtual GPU)
 - - video_cards_r128          : VIDEO_CARDS setting to build driver for ATI r128 video cards
 + + video_cards_radeon        : VIDEO_CARDS setting to build driver for ATI radeon video cards
 + + video_cards_radeonsi      : VIDEO_CARDS setting to build only Southern Islands based chips code for radeon
 - - video_cards_siliconmotion : VIDEO_CARDS setting to build driver for siliconmotion video cards
 - - video_cards_vesa          : VIDEO_CARDS setting to build driver for vesa video cards
 - - video_cards_via           : VIDEO_CARDS setting to build driver for via video cards
 - - video_cards_virtualbox    : VIDEO_CARDS setting to build driver for virtualbox emulation
 - - video_cards_vmware        : VIDEO_CARDS setting to build driver for vmware video cards
$ equery uses mesa
[ Legend : U - final flag setting for installation]
[        : I - package is installed with flag     ]
[ Colors : set, unset                             ]
 * Found these USE flags for media-libs/mesa-22.3.3:
 U I
 + + X                    : Add support for X11
 + + abi_x86_32           : 32-bit (x86) libraries
 + + cpu_flags_x86_sse2   : Use the SSE2 instruction set
 - - d3d9                 : Enable Direct 3D9 API through Nine state tracker. Can be used together with patched wine.
 - - debug                : Enable extra debug codepaths, like asserts and extra output. If you want to get meaningful backtraces see https://wiki.gentoo.org/wiki/Project:Quality_Assurance/Backtraces
 - - gles1                : Enable GLESv1 support.
 + + gles2                : Enable GLESv2 support.
 + + llvm                 : Enable LLVM backend for Gallium3D.
 - - lm-sensors           : Enable Gallium HUD lm-sensors support.
 - - opencl               : Enable the Clover Gallium OpenCL state tracker.
 - - osmesa               : Build the Mesa library for off-screen rendering.
 + + proprietary-codecs   : Enable codecs for patent-encumbered audio and video formats.
 - - test                 : Enable dependencies and/or preparations necessary to run tests (usually controlled by FEATURES=test but can be toggled independently)
 - - unwind               : Add support for call stack unwinding and function name resolution
 + + vaapi                : Enable Video Acceleration API for hardware decoding
 - - valgrind             : Compile in valgrind memory hints
 - - vdpau                : Enable the VDPAU acceleration interface for the Gallium3D Video Layer.
 - - video_cards_d3d12    : VIDEO_CARDS seeting to build driver for Microsoft WSL video cards
 - - video_cards_intel    : VIDEO_CARDS setting to build driver for Intel video cards
 - - video_cards_nouveau  : VIDEO_CARDS setting to build reverse-engineered driver for nvidia cards
 - - video_cards_r300     : VIDEO_CARDS setting to build only r300, r400 and r500 based chips code for radeon
 - - video_cards_r600     : VIDEO_CARDS setting to build only r600, r700, Evergreen and Northern Islands based chips code for radeon
 + + video_cards_radeon   : VIDEO_CARDS setting to build driver for ATI radeon video cards
 + + video_cards_radeonsi : VIDEO_CARDS setting to build only Southern Islands based chips code for radeon
 - - video_cards_virgl    : VIDEO_CARDS setting to build driver for virgil (virtual 3D GPU)
 - - video_cards_vmware   : VIDEO_CARDS setting to build driver for vmware video cards
 + + vulkan               : Enable Vulkan drivers
 - - vulkan-overlay       : Build vulkan-overlay-layer which displays Frames Per Second and other statistics
 + + wayland              : Enable support for dev-libs/wayland
 - - xa                   : Enable the XA (X Acceleration) API for Gallium3D.
 - - zink                 : Enable the Zink OpenGL-over-Vulkan Gallium driver
 + + zstd                 : Enable support for ZSTD compression
$ equery uses llvm
[ Legend : U - final flag setting for installation]
[        : I - package is installed with flag     ]
[ Colors : set, unset                             ]
 * Found these USE flags for sys-devel/llvm-15.0.6-r1:
 U I
 + + abi_x86_32      : 32-bit (x86) libraries
 + + binutils-plugin : Build the binutils plugin
 - - debug           : Enable extra debug codepaths, like asserts and extra output. If you want to get meaningful backtraces see https://wiki.gentoo.org/wiki/Project:Quality_Assurance/Backtraces
 - - doc             : Build and install the HTML documentation and regenerate the man pages
 - - exegesis        : Enable performance counter support for llvm-exegesis tool that can be used to measure host machine instruction characteristics
 - - libedit         : Use the libedit library (replacement for readline)
 + + libffi          : Enable support for Foreign Function Interface library
 + + ncurses         : Support querying terminal properties using ncurses' terminfo
 - - test            : Enable dependencies and/or preparations necessary to run tests (usually controlled by FEATURES=test but can be toggled independently)
 - - verify-sig      : Verify upstream signatures on distfiles
 - - xar             : Support dumping LLVM bitcode sections in Mach-O files (uses app-arch/xar)
 + + xml             : Add support for XML files
 - - z3              : Enable support for sci-mathematics/z3 constraint solver
 - - zstd            : Enable support for ZSTD compression
$ clinfo
Number of platforms                               1
  Platform Name                                   AMD Accelerated Parallel Processing
  Platform Vendor                                 Advanced Micro Devices, Inc.
  Platform Version                                OpenCL 2.1 AMD-APP.dbg (3423.0)
  Platform Profile                                FULL_PROFILE
  Platform Extensions                             cl_khr_icd cl_amd_event_callback 
  Platform Extensions function suffix             AMD
  Platform Host timer resolution                  1ns

  Platform Name                                   AMD Accelerated Parallel Processing
Number of devices                                 1
  Device Name                                     gfx1031
  Device Vendor                                   Advanced Micro Devices, Inc.
  Device Vendor ID                                0x1002
  Device Version                                  OpenCL 2.0 
  Driver Version                                  3423.0 (HSA1.1,LC)
  Device OpenCL C Version                         OpenCL C 2.0 
  Device Type                                     GPU
  Device Board Name (AMD)                         AMD Radeon RX 6700 XT
  Device PCI-e ID (AMD)                           0x73df
  Device Topology (AMD)                           PCI-E, 0000:03:00.0
  Device Profile                                  FULL_PROFILE
  Device Available                                Yes
  Compiler Available                              Yes
  Linker Available                                Yes
  Max compute units                               20
  SIMD per compute unit (AMD)                     4
  SIMD width (AMD)                                32
  SIMD instruction width (AMD)                    1
  Max clock frequency                             2725MHz
  Graphics IP (AMD)                               10.3
  Device Partition                                (core)
    Max number of sub-devices                     20
    Supported partition types                     None
    Supported affinity domains                    (n/a)
  Max work item dimensions                        3
  Max work item sizes                             1024x1024x1024
  Max work group size                             256
  Preferred work group size (AMD)                 256
  Max work group size (AMD)                       1024
  Preferred work group size multiple (kernel)     32
  Wavefront width (AMD)                           32
  Preferred / native vector sizes                 
    char                                                 4 / 4       
    short                                                2 / 2       
    int                                                  1 / 1       
    long                                                 1 / 1       
    half                                                 1 / 1        (cl_khr_fp16)
    float                                                1 / 1       
    double                                               1 / 1        (cl_khr_fp64)
  Half-precision Floating-point support           (cl_khr_fp16)
    Denormals                                     No
    Infinity and NANs                             No
    Round to nearest                              No
    Round to zero                                 No
    Round to infinity                             No
    IEEE754-2008 fused multiply-add               No
    Support is emulated in software               No
  Single-precision Floating-point support         (core)
    Denormals                                     Yes
    Infinity and NANs                             Yes
    Round to nearest                              Yes
    Round to zero                                 Yes
    Round to infinity                             Yes
    IEEE754-2008 fused multiply-add               Yes
    Support is emulated in software               No
    Correctly-rounded divide and sqrt operations  Yes
  Double-precision Floating-point support         (cl_khr_fp64)
    Denormals                                     Yes
    Infinity and NANs                             Yes
    Round to nearest                              Yes
    Round to zero                                 Yes
    Round to infinity                             Yes
    IEEE754-2008 fused multiply-add               Yes
    Support is emulated in software               No
  Address bits                                    64, Little-Endian
  Global memory size                              12868124672 (11.98GiB)
  Global free memory (AMD)                        12566528 (11.98GiB) 12566528 (11.98GiB)
  Global memory channels (AMD)                    6
  Global memory banks per channel (AMD)           4
  Global memory bank width (AMD)                  256 bytes
  Error Correction support                        No
  Max memory allocation                           10937905968 (10.19GiB)
  Unified memory for Host and Device              No
  Shared Virtual Memory (SVM) capabilities        (core)
    Coarse-grained buffer sharing                 Yes
    Fine-grained buffer sharing                   Yes
    Fine-grained system sharing                   No
    Atomics                                       No
  Minimum alignment for any data type             128 bytes
  Alignment of base address                       1024 bits (128 bytes)
  Preferred alignment for atomics                 
    SVM                                           0 bytes
    Global                                        0 bytes
    Local                                         0 bytes
  Max size for global variable                    10937905968 (10.19GiB)
  Preferred total size of global vars             12868124672 (11.98GiB)
  Global Memory cache type                        Read/Write
  Global Memory cache size                        16384 (16KiB)
  Global Memory cache line size                   64 bytes
  Image support                                   Yes
    Max number of samplers per kernel             29663
    Max size for 1D images from buffer            134217728 pixels
    Max 1D or 2D image array size                 8192 images
    Base address alignment for 2D image buffers   256 bytes
    Pitch alignment for 2D image buffers          256 pixels
    Max 2D image size                             16384x16384 pixels
    Max 3D image size                             16384x16384x8192 pixels
    Max number of read image args                 128
    Max number of write image args                8
    Max number of read/write image args           64
  Max number of pipe args                         16
  Max active pipe reservations                    16
  Max pipe packet size                            2347971376 (2.187GiB)
  Local memory type                               Local
  Local memory size                               65536 (64KiB)
  Local memory size per CU (AMD)                  65536 (64KiB)
  Local memory banks (AMD)                        32
  Max number of constant args                     8
  Max constant buffer size                        10937905968 (10.19GiB)
  Preferred constant buffer size (AMD)            16384 (16KiB)
  Max size of kernel argument                     1024
  Queue properties (on host)                      
    Out-of-order execution                        No
    Profiling                                     Yes
  Queue properties (on device)                    
    Out-of-order execution                        Yes
    Profiling                                     Yes
    Preferred size                                262144 (256KiB)
    Max size                                      8388608 (8MiB)
  Max queues on device                            1
  Max events on device                            1024
  Prefer user sync for interop                    Yes
  Number of P2P devices (AMD)                     0
  Profiling timer resolution                      1ns
  Profiling timer offset since Epoch (AMD)        0ns (Thu Jan  1 01:00:00 1970)
  Execution capabilities                          
    Run OpenCL kernels                            Yes
    Run native kernels                            No
    Thread trace supported (AMD)                  No
    Number of async queues (AMD)                  8
    Max real-time compute queues (AMD)            8
    Max real-time compute units (AMD)             20
  printf() buffer size                            4194304 (4MiB)
  Built-in kernels                                (n/a)
  Device Extensions                               cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_media_ops cl_amd_media_ops2 cl_khr_image2d_from_buffer cl_khr_subgroups cl_khr_depth_images cl_amd_copy_buffer_p2p cl_amd_assembly_program 

NULL platform behavior
  clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...)  No platform
  clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...)   No platform
  clCreateContext(NULL, ...) [default]            No platform
  clCreateContext(NULL, ...) [other]              Success [AMD]
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT)  Success (1)
    Platform Name                                 AMD Accelerated Parallel Processing
    Device Name                                   gfx1031
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU)  No devices found in platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU)  Success (1)
    Platform Name                                 AMD Accelerated Parallel Processing
    Device Name                                   gfx1031
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR)  No devices found in platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM)  No devices found in platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL)  Success (1)
    Platform Name                                 AMD Accelerated Parallel Processing
    Device Name                                   gfx1031

And the relevant world packages:

dev-libs/rocm-opencl-runtime
dev-util/clinfo
media-libs/libva-compat
media-libs/mesa
media-libs/vapoursynth
media-sound/alsa-utils
media-sound/easyeffects
media-sound/pavucontrol
media-video/mediainfo
media-video/mpv::vapoursynth
media-video/pipewire
media-video/wireplumber
sys-firmware/intel-microcode
sys-kernel/gentoo-kernel-bin
sys-kernel/linux-firmware
sys-process/lsof
x11-base/xorg-server
dbkblk commented 1 year ago

Well, I decided to move away from Gentoo. I was unable to solve the problem, even with closed source drivers. Thank you for your support.