Open diegocr opened 1 year ago
Latter sounds like possible bug in ffmpeg (illegal usage set on AVFrames?)
For cuda->vulkan hwupload, where does it segfault? Do you have a stack trace? Unfortunately I have no CUDA GPU to test with at the moment.
Thanks for your reply.
I don't have a stack trace but i can provide a -v trace
it does segfault after calling cu->cuImportExternalSemaphore()
ffmpeg version N-111315-g76a949bf65-gb40324d417+1 Copyright (c) 2000-2023 the FFmpeg developers built with gcc 13.1.0 (Rev7, Built by MSYS2 project) libavutil 58. 13.101 / 58. 13.101 libavcodec 60. 22.100 / 60. 22.100 libavformat 60. 10.100 / 60. 10.100 libavdevice 60. 2.100 / 60. 2.100 libavfilter 9. 8.102 / 9. 8.102 libswscale 7. 3.100 / 7. 3.100 libswresample 4. 11.100 / 4. 11.100 libpostproc 57. 2.100 / 57. 2.100 Splitting the commandline. Reading option '-hwaccel' ... matched as option 'hwaccel' (use HW accelerated decoding) with argument 'cuda'. Reading option '-hwaccel_output_format' ... matched as option 'hwaccel_output_format' (select output format used with HW accelerated decoding) with argument 'cuda'. Reading option '-extra_hw_frames' ... matched as AVOption 'extra_hw_frames' with argument '3'. Reading option '-init_hw_device' ... matched as option 'init_hw_device' (initialise hardware device) with argument 'vulkan=gpu:0'. Reading option '-filter_hw_device' ... matched as option 'filter_hw_device' (set hardware device used when filtering) with argument 'gpu'. Reading option '-i' ... matched as input url with argument 'V:\OKR 2023 Q3.mkv'. Reading option '-vf' ... matched as option 'vf' (set video filters) with argument 'scale_cuda=w=1920:h=-1:interp_algo=lanczos,hwupload=derive_device=vulkan,libplacebo=format=yuv420p10le:downscaler=ewa_lanczos'. Reading option '-c:v' ... matched as option 'c' (codec name) with argument 'hevc_nvenc'. Reading option '-pix_fmt' ... matched as option 'pix_fmt' (set pixel format) with argument 'p010le'. Reading option '-an' ... matched as option 'an' (disable audio) with argument '1'. Reading option '-v' ... matched as option 'v' (set logging level) with argument 'trace'. Reading option '-y' ... matched as option 'y' (overwrite output files) with argument '1'. Reading option 'V:\OKR 2023 Q3 1080p.mkv' ... matched as output url. Finished splitting the commandline. Parsing a group of options: global . Applying option init_hw_device (initialise hardware device) with argument vulkan=gpu:0. [AVHWDeviceContext @ 000002478a7f7440] Supported validation layers: [AVHWDeviceContext @ 000002478a7f7440] VK_LAYER_NV_optimus [AVHWDeviceContext @ 000002478a7f7440] Using instance extension VK_KHR_portability_enumeration [AVHWDeviceContext @ 000002478a7f7440] GPU listing: [AVHWDeviceContext @ 000002478a7f7440] 0: NVIDIA RTX A5500 Laptop GPU (discrete) (0x2438) [AVHWDeviceContext @ 000002478a7f7440] 1: Intel(R) UHD Graphics 770 (integrated) (0x4688) [AVHWDeviceContext @ 000002478a7f7440] Device 0 selected: NVIDIA RTX A5500 Laptop GPU (discrete) (0x2438) [AVHWDeviceContext @ 000002478a7f7440] Queue families: [AVHWDeviceContext @ 000002478a7f7440] 0: graphics compute transfer sparse (queues: 16) [AVHWDeviceContext @ 000002478a7f7440] 1: transfer sparse (queues: 2) [AVHWDeviceContext @ 000002478a7f7440] 2: compute transfer sparse (queues: 8) [AVHWDeviceContext @ 000002478a7f7440] 3: transfer decode sparse (queues: 2) [AVHWDeviceContext @ 000002478a7f7440] 4: transfer sparse (queues: 1) [AVHWDeviceContext @ 000002478a7f7440] Using device extension VK_KHR_push_descriptor [AVHWDeviceContext @ 000002478a7f7440] Using device extension VK_KHR_sampler_ycbcr_conversion [AVHWDeviceContext @ 000002478a7f7440] Using device extension VK_EXT_descriptor_buffer [AVHWDeviceContext @ 000002478a7f7440] Using device extension VK_EXT_shader_atomic_float [AVHWDeviceContext @ 000002478a7f7440] Using device extension VK_EXT_external_memory_host [AVHWDeviceContext @ 000002478a7f7440] Using device extension VK_KHR_external_memory_win32 [AVHWDeviceContext @ 000002478a7f7440] Using device extension VK_KHR_external_semaphore_win32 [AVHWDeviceContext @ 000002478a7f7440] Using device extension VK_KHR_video_queue [AVHWDeviceContext @ 000002478a7f7440] Using device extension VK_KHR_video_decode_queue [AVHWDeviceContext @ 000002478a7f7440] Using device extension VK_KHR_video_decode_h264 [AVHWDeviceContext @ 000002478a7f7440] Using device extension VK_KHR_video_decode_h265 [AVHWDeviceContext @ 000002478a7f7440] Using device: NVIDIA RTX A5500 Laptop GPU [AVHWDeviceContext @ 000002478a7f7440] Alignments: [AVHWDeviceContext @ 000002478a7f7440] optimalBufferCopyRowPitchAlignment: 1 [AVHWDeviceContext @ 000002478a7f7440] minMemoryMapAlignment: 64 [AVHWDeviceContext @ 000002478a7f7440] nonCoherentAtomSize: 64 [AVHWDeviceContext @ 000002478a7f7440] minImportedHostPointerAlignment: 4096 [AVHWDeviceContext @ 000002478a7f7440] Using queue family 0 (queues: 16) for graphics [AVHWDeviceContext @ 000002478a7f7440] Using queue family 1 (queues: 2) for transfers [AVHWDeviceContext @ 000002478a7f7440] Using queue family 2 (queues: 8) for compute [AVHWDeviceContext @ 000002478a7f7440] Using queue family 3 (queues: 2) for decode Applying option filter_hw_device (set hardware device used when filtering) with argument gpu. Applying option v (set logging level) with argument trace. Applying option y (overwrite output files) with argument 1. Successfully parsed a group of options. Parsing a group of options: input url V:\OKR 2023 Q3.mkv. Applying option hwaccel (use HW accelerated decoding) with argument cuda. Applying option hwaccel_output_format (select output format used with HW accelerated decoding) with argument cuda. Successfully parsed a group of options. Opening an input file: V:\OKR 2023 Q3.mkv. [AVFormatContext @ 000002478c18b1c0] Opening 'V:\OKR 2023 Q3.mkv' for reading [file @ 000002478c18be80] Setting default whitelist 'file,crypto,data' Probing matroska,webm score:100 size:2048 [matroska,webm @ 000002478c18b1c0] Format matroska,webm probed with size=2048 and score=100 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x22B59D at pos. 5112 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x22B59D at pos. 5236 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x22B59D at pos. 5291 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x22B59D at pos. 5354 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x22B59D at pos. 5413 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x22B59D at pos. 5457 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x22B59D at pos. 5501 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x22B59D at pos. 5556 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7033 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7085 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7138 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7191 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7244 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7297 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7350 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7403 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7456 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7509 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7562 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7615 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7668 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7721 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7774 [matroska,webm @ 000002478c18b1c0] Unknown entry 0x437D at pos. 7827 st:0 removing common factor 1000000 from timebase st:1 removing common factor 1000000 from timebase st:2 removing common factor 1000000 from timebase st:3 removing common factor 1000000 from timebase st:4 removing common factor 1000000 from timebase st:5 removing common factor 1000000 from timebase st:6 removing common factor 1000000 from timebase st:7 removing common factor 1000000 from timebase [matroska,webm @ 000002478c18b1c0] Before avformat_find_stream_info() pos: 7939 bytes read:35345 seeks:2 nb_streams:8 [hevc @ 000002478c1a1940] nal_unit_type: 32(VPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1a1940] Decoding VPS [hevc @ 000002478c1a1940] Main 10 profile bitstream [hevc @ 000002478c1a1940] nal_unit_type: 33(SPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1a1940] Decoding SPS [hevc @ 000002478c1a1940] Main 10 profile bitstream [hevc @ 000002478c1a1940] Decoding VUI [hevc @ 000002478c1a1940] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1a1940] Decoding PPS For transform of length 128, inverse, mdct_float, flags: [aligned, out_of_place], found 3 matches: 1: mdct_inv_float_avx2 - type: mdct_float, len: [16, ā], factors[2]: [2, any], flags: [aligned, out_of_place, inv_only], prio: 544 2: mdct_inv_float_c - type: mdct_float, len: [2, ā], factors[2]: [2, any], flags: [unaligned, out_of_place, inv_only], prio: 96 3: mdct_naive_inv_float_c - type: mdct_float, len: [2, ā], factors[2]: [2, any], flags: [unaligned, out_of_place, inv_only], prio: -130976 For transform of length 64, inverse, fft_float, flags: [aligned, inplace, preshuf, asm_call], found 3 matches: 1: fft_sr_asm_float_avx2 - type: fft_float, len: [64, 131072], factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call], prio: 480 2: fft_sr_asm_float_fma3 - type: fft_float, len: [64, 131072], factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call], prio: 448 3: fft_sr_asm_float_avx - type: fft_float, len: [64, 131072], factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call], prio: 416 Transform tree: mdct_inv_float_avx2 - type: mdct_float, len: 128, factors[2]: [2, any], flags: [aligned, out_of_place, inv_only] fft_sr_asm_float_avx2 - type: fft_float, len: 64, factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call] For transform of length 256, inverse, mdct_float, flags: [aligned, out_of_place], found 3 matches: 1: mdct_inv_float_avx2 - type: mdct_float, len: [16, ā], factors[2]: [2, any], flags: [aligned, out_of_place, inv_only], prio: 544 2: mdct_inv_float_c - type: mdct_float, len: [2, ā], factors[2]: [2, any], flags: [unaligned, out_of_place, inv_only], prio: 96 3: mdct_naive_inv_float_c - type: mdct_float, len: [2, ā], factors[2]: [2, any], flags: [unaligned, out_of_place, inv_only], prio: -130976 For transform of length 128, inverse, fft_float, flags: [aligned, inplace, preshuf, asm_call], found 3 matches: 1: fft_sr_asm_float_avx2 - type: fft_float, len: [64, 131072], factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call], prio: 480 2: fft_sr_asm_float_fma3 - type: fft_float, len: [64, 131072], factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call], prio: 448 3: fft_sr_asm_float_avx - type: fft_float, len: [64, 131072], factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call], prio: 416 Transform tree: mdct_inv_float_avx2 - type: mdct_float, len: 256, factors[2]: [2, any], flags: [aligned, out_of_place, inv_only] fft_sr_asm_float_avx2 - type: fft_float, len: 128, factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call] For transform of length 128, inverse, mdct_float, flags: [aligned, out_of_place], found 3 matches: 1: mdct_inv_float_avx2 - type: mdct_float, len: [16, ā], factors[2]: [2, any], flags: [aligned, out_of_place, inv_only], prio: 544 2: mdct_inv_float_c - type: mdct_float, len: [2, ā], factors[2]: [2, any], flags: [unaligned, out_of_place, inv_only], prio: 96 3: mdct_naive_inv_float_c - type: mdct_float, len: [2, ā], factors[2]: [2, any], flags: [unaligned, out_of_place, inv_only], prio: -130976 For transform of length 64, inverse, fft_float, flags: [aligned, inplace, preshuf, asm_call], found 3 matches: 1: fft_sr_asm_float_avx2 - type: fft_float, len: [64, 131072], factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call], prio: 480 2: fft_sr_asm_float_fma3 - type: fft_float, len: [64, 131072], factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call], prio: 448 3: fft_sr_asm_float_avx - type: fft_float, len: [64, 131072], factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call], prio: 416 Transform tree: mdct_inv_float_avx2 - type: mdct_float, len: 128, factors[2]: [2, any], flags: [aligned, out_of_place, inv_only] fft_sr_asm_float_avx2 - type: fft_float, len: 64, factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call] For transform of length 256, inverse, mdct_float, flags: [aligned, out_of_place], found 3 matches: 1: mdct_inv_float_avx2 - type: mdct_float, len: [16, ā], factors[2]: [2, any], flags: [aligned, out_of_place, inv_only], prio: 544 2: mdct_inv_float_c - type: mdct_float, len: [2, ā], factors[2]: [2, any], flags: [unaligned, out_of_place, inv_only], prio: 96 3: mdct_naive_inv_float_c - type: mdct_float, len: [2, ā], factors[2]: [2, any], flags: [unaligned, out_of_place, inv_only], prio: -130976 For transform of length 128, inverse, fft_float, flags: [aligned, inplace, preshuf, asm_call], found 3 matches: 1: fft_sr_asm_float_avx2 - type: fft_float, len: [64, 131072], factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call], prio: 480 2: fft_sr_asm_float_fma3 - type: fft_float, len: [64, 131072], factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call], prio: 448 3: fft_sr_asm_float_avx - type: fft_float, len: [64, 131072], factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call], prio: 416 Transform tree: mdct_inv_float_avx2 - type: mdct_float, len: 256, factors[2]: [2, any], flags: [aligned, out_of_place, inv_only] fft_sr_asm_float_avx2 - type: fft_float, len: 128, factor: 2, flags: [aligned, inplace, out_of_place, preshuf, asm_call] [matroska,webm @ 000002478c18b1c0] first_dts 0 not matching first dts NOPTS (pts 0, duration 41) in the queue [matroska,webm @ 000002478c18b1c0] All info found [matroska,webm @ 000002478c18b1c0] stream 0: start_time: 0 duration: NOPTS Selecting decoder 'hevc' because of requested hwaccel method cuda Input #0, matroska,webm, from 'V:\OKR 2023 Q3.mkv': Stream #0:0(eng), 13, 1/1000: Video: hevc (Main 10), 1 reference frame, yuv420p10le(tv, bt2020nc/bt2020/smpte2084, topleft), 3840x2160 [SAR 1:1 DAR 16:9], 0/1, 23.98 fps, 23.98 tbr, 1k tbn (default) Successfully opened the file. Parsing a group of options: output url V:\OKR 2023 Q3 1080p.mkv. Applying option vf (set video filters) with argument scale_cuda=w=1920:h=-1:interp_algo=lanczos,hwupload=derive_device=vulkan,libplacebo=apply_dolbyvision=1:format=yuv420p10le:downscaler=ewa_lanczos. Applying option c:v (codec name) with argument hevc_nvenc. Applying option pix_fmt (set pixel format) with argument p010le. Applying option an (disable audio) with argument 1. Successfully parsed a group of options. Opening an output file: V:\OKR 2023 Q3 1080p.mkv. [out#0/matroska @ 000002478a7fef00] No explicit maps, mapping streams automatically... [vost#0:0/hevc_nvenc @ 000002478c24e840] Created video stream from input stream 0:0 [AVFilterGraph @ 000002479c64ce00] Setting 'w' to value '1920' [AVFilterGraph @ 000002479c64ce00] Setting 'h' to value '-1' [AVFilterGraph @ 000002479c64ce00] Setting 'interp_algo' to value 'lanczos' [AVFilterGraph @ 000002479c64ce00] Setting 'derive_device' to value 'vulkan' [AVFilterGraph @ 000002479c64ce00] Setting 'format' to value 'yuv420p10le' [AVFilterGraph @ 000002479c64ce00] Setting 'downscaler' to value 'ewa_lanczos' [libplacebo @ 000002478c24eb00] Initialized libplacebo v6.287.0-rc1-48-gdec96590 (API v290) [AVHWDeviceContext @ 000002478c1ad4c0] Loaded lib: nvcuda.dll [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuInit [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDriverGetVersion [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDeviceGetCount [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDeviceGet [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDeviceGetAttribute [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDeviceGetName [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDeviceComputeCapability [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuCtxCreate_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuCtxSetLimit [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuCtxPushCurrent_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuCtxPopCurrent_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuCtxDestroy_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemAlloc_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemAllocPitch_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemAllocManaged [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemsetD8Async [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemFree_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemcpy [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemcpyAsync [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemcpy2D_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemcpy2DAsync_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemcpyHtoD_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemcpyHtoDAsync_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemcpyDtoH_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemcpyDtoHAsync_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemcpyDtoD_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMemcpyDtoDAsync_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuGetErrorName [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuGetErrorString [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuCtxGetDevice [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDevicePrimaryCtxRetain [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDevicePrimaryCtxRelease [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDevicePrimaryCtxSetFlags [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDevicePrimaryCtxGetState [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDevicePrimaryCtxReset [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuStreamCreate [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuStreamQuery [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuStreamSynchronize [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuStreamDestroy_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuStreamAddCallback [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuStreamWaitEvent [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuEventCreate [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuEventDestroy_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuEventSynchronize [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuEventQuery [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuEventRecord [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuLaunchKernel [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuLinkCreate [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuLinkAddData [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuLinkComplete [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuLinkDestroy [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuModuleLoadData [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuModuleUnload [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuModuleGetFunction [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuModuleGetGlobal [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuTexObjectCreate [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuTexObjectDestroy [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuGLGetDevices_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuGraphicsGLRegisterImage [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuGraphicsUnregisterResource [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuGraphicsMapResources [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuGraphicsUnmapResources [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuGraphicsSubResourceGetMappedArray [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuGraphicsResourceGetMappedPointer_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDeviceGetUuid [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDeviceGetUuid_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDeviceGetLuid [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDeviceGetByPCIBusId [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDeviceGetPCIBusId [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuImportExternalMemory [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDestroyExternalMemory [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuExternalMemoryGetMappedBuffer [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuExternalMemoryGetMappedMipmappedArray [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMipmappedArrayGetLevel [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuMipmappedArrayDestroy [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuImportExternalSemaphore [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuDestroyExternalSemaphore [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuSignalExternalSemaphoresAsync [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuWaitExternalSemaphoresAsync [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuArrayCreate_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuArray3DCreate_v2 [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuArrayDestroy [AVHWDeviceContext @ 000002478c1ad4c0] Cannot load optional cuEGLStreamProducerConnect [AVHWDeviceContext @ 000002478c1ad4c0] Cannot load optional cuEGLStreamProducerDisconnect [AVHWDeviceContext @ 000002478c1ad4c0] Cannot load optional cuEGLStreamConsumerDisconnect [AVHWDeviceContext @ 000002478c1ad4c0] Cannot load optional cuEGLStreamProducerPresentFrame [AVHWDeviceContext @ 000002478c1ad4c0] Cannot load optional cuEGLStreamProducerReturnFrame [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuD3D11GetDevice [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuD3D11GetDevices [AVHWDeviceContext @ 000002478c1ad4c0] Loaded sym: cuGraphicsD3D11RegisterResource [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuInit(0) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuDeviceGet(&hwctx->internal->cuda_device, device_idx) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuCtxCreate(&hwctx->cuda_ctx, desired_flags, hwctx->internal->cuda_device) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuCtxPopCurrent(&dummy) detected 24 logical cores [hevc @ 000002478c1c1980] nal_unit_type: 32(VPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1980] Decoding VPS [hevc @ 000002478c1c1980] Main 10 profile bitstream [hevc @ 000002478c1c1980] nal_unit_type: 33(SPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1980] Decoding SPS [hevc @ 000002478c1c1980] Main 10 profile bitstream [hevc @ 000002478c1c1980] Decoding VUI [hevc @ 000002478c1c1980] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1980] Decoding PPS [file @ 000002479c652740] Setting default whitelist 'file,crypto,data' Successfully opened the file. Stream mapping: Stream #0:0 -> #0:0 (hevc (native) -> hevc (hevc_nvenc)) Press [q] to stop, [?] for help [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 000002478c1c1980] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1980] nal_unit_type: 32(VPS), nuh_layer_id: 0, temporal_id: 0 [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 000002478c1c1980] nal_unit_type: 33(SPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1980] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1980] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 Last message repeated 5 times [hevc @ 000002478c1c1980] nal_unit_type: 19(IDR_W_RADL), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 000002478c1c1980] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1980] Decoding VPS [hevc @ 000002478c1c1980] Main 10 profile bitstream [hevc @ 000002478c1c1980] Decoding SPS [hevc @ 000002478c1c1980] Main 10 profile bitstream [hevc @ 000002478c1c1980] Decoding VUI [hevc @ 000002478c1c1980] Decoding PPS [hevc @ 000002478c1c1980] Decoding SEI Last message repeated 1 times [hevc @ 000002478c1c1980] Skipped PREFIX SEI 0 [hevc @ 000002478c1c1980] Decoding SEI Last message repeated 1 times [hevc @ 000002478c1c1980] Skipped PREFIX SEI 6 [hevc @ 000002478c1c1980] Decoding SEI Last message repeated 1 times [hevc @ 000002478c1c1980] Format cuda chosen by get_format(). [hevc @ 000002478c1c1980] Format cuda requires hwaccel initialisation. [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuDeviceGetAttribute(&priv->tex_alignment, 14 , hwctx->internal->cuda_device) [AVHWFramesContext @ 000002479c64d580] CUDA texture alignment: 512 [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuDeviceGetAttribute(&priv->tex_alignment, 14 , hwctx->internal->cuda_device) [AVHWFramesContext @ 000002479c64f000] CUDA texture alignment: 512 [hevc @ 000002478c1c1980] Loaded lib: nvcuvid.dll [hevc @ 000002478c1c1980] Loaded sym: cuvidGetDecoderCaps [hevc @ 000002478c1c1980] Loaded sym: cuvidCreateDecoder [hevc @ 000002478c1c1980] Loaded sym: cuvidDestroyDecoder [hevc @ 000002478c1c1980] Loaded sym: cuvidDecodePicture [hevc @ 000002478c1c1980] Loaded sym: cuvidGetDecodeStatus [hevc @ 000002478c1c1980] Loaded sym: cuvidReconfigureDecoder [hevc @ 000002478c1c1980] Loaded sym: cuvidMapVideoFrame64 [hevc @ 000002478c1c1980] Loaded sym: cuvidUnmapVideoFrame64 [hevc @ 000002478c1c1980] Loaded sym: cuvidCtxLockCreate [hevc @ 000002478c1c1980] Loaded sym: cuvidCtxLockDestroy [hevc @ 000002478c1c1980] Loaded sym: cuvidCtxLock [hevc @ 000002478c1c1980] Loaded sym: cuvidCtxUnlock [hevc @ 000002478c1c1980] Loaded sym: cuvidCreateVideoSource [hevc @ 000002478c1c1980] Loaded sym: cuvidCreateVideoSourceW [hevc @ 000002478c1c1980] Loaded sym: cuvidDestroyVideoSource [hevc @ 000002478c1c1980] Loaded sym: cuvidSetVideoSourceState [hevc @ 000002478c1c1980] Loaded sym: cuvidGetVideoSourceState [hevc @ 000002478c1c1980] Loaded sym: cuvidGetSourceVideoFormat [hevc @ 000002478c1c1980] Loaded sym: cuvidGetSourceAudioFormat [hevc @ 000002478c1c1980] Loaded sym: cuvidCreateVideoParser [hevc @ 000002478c1c1980] Loaded sym: cuvidParseVideoData [hevc @ 000002478c1c1980] Loaded sym: cuvidDestroyVideoParser [hevc @ 000002478c1c1980] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1c1980] Calling decoder->cvdl->cuvidGetDecoderCaps(&caps) [hevc @ 000002478c1c1980] NVDEC capabilities: [hevc @ 000002478c1c1980] format supported: yes, max_mb_count: 262144 [hevc @ 000002478c1c1980] min_width: 144, max_width: 8192 [hevc @ 000002478c1c1980] min_height: 144, max_height: 8192 [hevc @ 000002478c1c1980] Calling decoder->cvdl->cuvidCreateDecoder(&decoder->decoder, params) [hevc @ 000002478c1c1980] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [hevc @ 000002478c1c1980] Mastering Display Metadata: [hevc @ 000002478c1c1980] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 000002478c1c1980] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 000002478c1c1980] Content Light Level Metadata: [hevc @ 000002478c1c1980] MaxCLL=1000, MaxFALL=629 [hevc @ 000002478c1c1980] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1c1980] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 000002478c1c1980] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [hevc @ 000002478c1c1d80] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1d80] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1d80] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1d80] nal_unit_type: 1(TRAIL_R), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 000002478c1c1d80] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 000002478c1c1d80] Decoding PPS [hevc @ 000002478c1c1d80] Decoding SEI [hevc @ 000002478c1c1d80] Mastering Display Metadata: [hevc @ 000002478c1c1d80] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 000002478c1c1d80] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 000002478c1c1d80] Content Light Level Metadata: [hevc @ 000002478c1c1d80] MaxCLL=1000, MaxFALL=629 [hevc @ 000002478c1c1d80] Output frame with POC 0. [hevc @ 000002478c1c1d80] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1c1d80] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 000002478c1c1d80] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 000002478c1c0580] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c0580] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c0580] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c0580] nal_unit_type: 0(TRAIL_N), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 000002478c1c0580] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c0580] Decoding PPS [hevc @ 000002478c1c0580] Decoding SEI [hevc @ 000002478c1c0580] Mastering Display Metadata: [hevc @ 000002478c1c0580] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 000002478c1c0580] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 000002478c1c0580] Content Light Level Metadata: [hevc @ 000002478c1c0580] MaxCLL=1000, MaxFALL=629 [hevc @ 000002478c1c0580] Output frame with POC 1. [hevc @ 000002478c1c0580] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1c0580] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 000002478c1c0580] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [hevc @ 000002478c1c0980] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c0980] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c0980] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c0980] nal_unit_type: 0(TRAIL_N), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 000002478c1c0980] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 000002478c1c0980] Decoding PPS [hevc @ 000002478c1c0980] Decoding SEI [hevc @ 000002478c1c0980] Mastering Display Metadata: [hevc @ 000002478c1c0980] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 000002478c1c0980] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 000002478c1c0980] Content Light Level Metadata: [hevc @ 000002478c1c0980] MaxCLL=1000, MaxFALL=629 [hevc @ 000002478c1c0980] Output frame with POC 2. [hevc @ 000002478c1c0980] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1c0980] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 000002478c1c0980] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [hevc @ 000002478c1c2180] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c2180] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c2180] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c2180] nal_unit_type: 1(TRAIL_R), nuh_layer_id: 0, temporal_id: 0 [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 000002478c1c2180] nal_unit_type: 1(TRAIL_R), nuh_layer_id: 0, temporal_id: 0 Last message repeated 6 times [hevc @ 000002478c1c2180] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c2180] Decoding PPS [hevc @ 000002478c1c2180] Decoding SEI [hevc @ 000002478c1c2180] Mastering Display Metadata: [hevc @ 000002478c1c2180] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 000002478c1c2180] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 000002478c1c2180] Content Light Level Metadata: [hevc @ 000002478c1c2180] MaxCLL=1000, MaxFALL=629 [hevc @ 000002478c1c2180] Output frame with POC 3. [hevc @ 000002478c1c2180] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1c2180] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 000002478c1c2180] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [hevc @ 000002478c1c2580] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c2580] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c2580] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c2580] nal_unit_type: 0(TRAIL_N), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 000002478c1c2580] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 000002478c1c2580] Decoding PPS [hevc @ 000002478c1c2580] Decoding SEI [hevc @ 000002478c1c2580] Mastering Display Metadata: [hevc @ 000002478c1c2580] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 000002478c1c2580] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 000002478c1c2580] Content Light Level Metadata: [hevc @ 000002478c1c2580] MaxCLL=1000, MaxFALL=629 [hevc @ 000002478c1c2580] Output frame with POC 4. [hevc @ 000002478c1c2580] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1c2580] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 000002478c1c2580] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 000002478c1bfd80] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1bfd80] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1bfd80] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1bfd80] nal_unit_type: 0(TRAIL_N), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 000002478c1bfd80] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1bfd80] Decoding PPS [hevc @ 000002478c1bfd80] Decoding SEI [hevc @ 000002478c1bfd80] Mastering Display Metadata: [hevc @ 000002478c1bfd80] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 000002478c1bfd80] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 000002478c1bfd80] Content Light Level Metadata: [hevc @ 000002478c1bfd80] MaxCLL=1000, MaxFALL=629 [hevc @ 000002478c1bfd80] Output frame with POC 5. [hevc @ 000002478c1bfd80] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1bfd80] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 000002478c1bfd80] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [hevc @ 000002478c1c2d80] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c2d80] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c2d80] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c2d80] nal_unit_type: 1(TRAIL_R), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 000002478c1c2d80] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 000002478c1c2d80] Decoding PPS [hevc @ 000002478c1c2d80] Decoding SEI [hevc @ 000002478c1c2d80] Mastering Display Metadata: [hevc @ 000002478c1c2d80] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 000002478c1c2d80] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 000002478c1c2d80] Content Light Level Metadata: [hevc @ 000002478c1c2d80] MaxCLL=1000, MaxFALL=629 [hevc @ 000002478c1c2d80] Output frame with POC 6. [hevc @ 000002478c1c2d80] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1c2d80] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 000002478c1c2d80] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [hevc @ 000002478c1c3180] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c3180] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c3180] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 000002478c1c3180] nal_unit_type: 0(TRAIL_N), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 000002478c1c3180] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c3180] Decoding PPS [hevc @ 000002478c1c3180] Decoding SEI [hevc @ 000002478c1c3180] Mastering Display Metadata: [hevc @ 000002478c1c3180] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 000002478c1c3180] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 000002478c1c3180] Content Light Level Metadata: [hevc @ 000002478c1c3180] MaxCLL=1000, MaxFALL=629 [hevc @ 000002478c1c3180] Output frame with POC 7. [hevc @ 000002478c1c3180] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1c3180] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 000002478c1c3180] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 000002478c1c1580] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1580] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1580] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1580] nal_unit_type: 0(TRAIL_N), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 000002478c1c1580] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1580] Decoding PPS [hevc @ 000002478c1c1580] Decoding SEI [hevc @ 000002478c1c1580] Mastering Display Metadata: [hevc @ 000002478c1c1580] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 000002478c1c1580] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 000002478c1c1580] Content Light Level Metadata: [hevc @ 000002478c1c1580] MaxCLL=1000, MaxFALL=629 [hevc @ 000002478c1c1580] Output frame with POC 8. [hevc @ 000002478c1c1580] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1c1580] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 000002478c1c1580] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 000002478c1c3580] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c3580] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c3580] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c3580] nal_unit_type: 1(TRAIL_R), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 000002478c1c3580] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c3580] Decoding PPS [hevc @ 000002478c1c3580] Decoding SEI [hevc @ 000002478c1c3580] Mastering Display Metadata: [hevc @ 000002478c1c3580] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 000002478c1c3580] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 000002478c1c3580] Content Light Level Metadata: [hevc @ 000002478c1c3580] MaxCLL=1000, MaxFALL=629 [hevc @ 000002478c1c3580] Output frame with POC 9. [hevc @ 000002478c1c3580] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1c3580] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 000002478c1c3580] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 000002478c1c0d80] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c0d80] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c0d80] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c0d80] nal_unit_type: 0(TRAIL_N), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 000002478c1c0d80] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c0d80] Decoding PPS [hevc @ 000002478c1c0d80] Decoding SEI [hevc @ 000002478c1c0d80] Mastering Display Metadata: [hevc @ 000002478c1c0d80] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 000002478c1c0d80] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 000002478c1c0d80] Content Light Level Metadata: [hevc @ 000002478c1c0d80] MaxCLL=1000, MaxFALL=629 [hevc @ 000002478c1c0d80] Output frame with POC 10. [hevc @ 000002478c1c0d80] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1c0d80] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 000002478c1c0d80] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 000002478c1c1180] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1180] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1180] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1180] nal_unit_type: 0(TRAIL_N), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 000002478c1c1180] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1180] Decoding PPS [hevc @ 000002478c1c1180] Decoding SEI [hevc @ 000002478c1c1180] Mastering Display Metadata: [hevc @ 000002478c1c1180] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 000002478c1c1180] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 000002478c1c1180] Content Light Level Metadata: [hevc @ 000002478c1c1180] MaxCLL=1000, MaxFALL=629 [hevc @ 000002478c1c1180] Output frame with POC 11. [hevc @ 000002478c1c1180] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1c1180] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 000002478c1c1180] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 00000247b3b791c0] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 00000247b3b791c0] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 00000247b3b791c0] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 00000247b3b791c0] nal_unit_type: 1(TRAIL_R), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 00000247b3b791c0] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [hevc @ 00000247b3b791c0] Decoding PPS [hevc @ 00000247b3b791c0] Decoding SEI [hevc @ 00000247b3b791c0] Mastering Display Metadata: [hevc @ 00000247b3b791c0] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 00000247b3b791c0] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 00000247b3b791c0] Content Light Level Metadata: [hevc @ 00000247b3b791c0] MaxCLL=1000, MaxFALL=629 [hevc @ 00000247b3b791c0] Output frame with POC 12. [hevc @ 00000247b3b791c0] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 00000247b3b791c0] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 00000247b3b791c0] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [hevc @ 00000247b3b71dc0] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 00000247b3b71dc0] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 00000247b3b71dc0] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 00000247b3b71dc0] nal_unit_type: 0(TRAIL_N), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 00000247b3b71dc0] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [hevc @ 00000247b3b71dc0] Decoding PPS [hevc @ 00000247b3b71dc0] Decoding SEI [hevc @ 00000247b3b71dc0] Mastering Display Metadata: [hevc @ 00000247b3b71dc0] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 00000247b3b71dc0] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 00000247b3b71dc0] Content Light Level Metadata: [hevc @ 00000247b3b71dc0] MaxCLL=1000, MaxFALL=629 [hevc @ 00000247b3b71dc0] Output frame with POC 13. [hevc @ 00000247b3b71dc0] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 00000247b3b71dc0] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 00000247b3b71dc0] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [vost#0:0/hevc_nvenc @ 000002478c24e840] cur_dts is invalid [init:0 i_done:0 finish:0] (this is harmless if it occurs once at the start per stream) [hevc @ 00000247b3b75dc0] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 00000247b3b75dc0] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 00000247b3b75dc0] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 00000247b3b75dc0] nal_unit_type: 0(TRAIL_N), nuh_layer_id: 0, temporal_id: 0 Last message repeated 7 times [hevc @ 00000247b3b75dc0] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [hevc @ 00000247b3b75dc0] Decoding PPS [hevc @ 00000247b3b75dc0] Decoding SEI [hevc @ 00000247b3b75dc0] Mastering Display Metadata: [hevc @ 00000247b3b75dc0] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 00000247b3b75dc0] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 00000247b3b75dc0] Content Light Level Metadata: [hevc @ 00000247b3b75dc0] MaxCLL=1000, MaxFALL=629 [hevc @ 00000247b3b75dc0] Output frame with POC 14. [hevc @ 00000247b3b75dc0] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 00000247b3b75dc0] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 00000247b3b75dc0] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [hevc @ 000002478c1c1980] nal_unit_type: 35(AUD), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1980] nal_unit_type: 34(PPS), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1980] nal_unit_type: 39(SEI_PREFIX), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1980] nal_unit_type: 1(TRAIL_R), nuh_layer_id: 0, temporal_id: 0 Last message repeated 6 times [hevc @ 000002478a7fe740] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [hevc @ 000002478c1c1980] nal_unit_type: 1(TRAIL_R), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478c1c1980] nal_unit_type: 62(UNSPEC62), nuh_layer_id: 0, temporal_id: 0 [hevc @ 000002478a7fe740] Calling decoder->cvdl->cuvidMapVideoFrame(decoder->decoder, cf->idx, &devptr, &pitch, &vpp) [hevc @ 000002478c1c1980] Decoding PPS [hevc @ 000002478a7fe740] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuCtxPushCurrent(hwctx->cuda_ctx) [hevc @ 000002478c1c1980] Decoding SEI [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuMemAlloc(&data, size) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuCtxPopCurrent(&dummy) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuCtxPushCurrent(hwctx->cuda_ctx) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuMemcpy2DAsync(&cpy, hwctx->stream) [hevc @ 000002478c1c1980] Mastering Display Metadata: [hevc @ 000002478c1c1980] r(0.6800,0.3200) g(0.2650,0.6900) b(0.1500 0.0600) wp(0.3127, 0.3290) [hevc @ 000002478c1c1980] min_luminance=0.000100, max_luminance=1000.000000 [hevc @ 000002478c1c1980] Content Light Level Metadata: [hevc @ 000002478c1c1980] MaxCLL=1000, MaxFALL=629 [hevc @ 000002478c1c1980] Output frame with POC 15. [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuMemcpy2DAsync(&cpy, hwctx->stream) [hevc @ 000002478c1c1980] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuCtxPopCurrent(&dummy) [hevc @ 000002478c1c1980] Calling decoder->cvdl->cuvidDecodePicture(decoder->decoder, &ctx->pic_params) [hevc @ 000002478c1c1980] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [AVHWDeviceContext @ 000002478c1ad4c0] Calling decoder->cudl->cuCtxPushCurrent(decoder->cuda_ctx) [AVHWDeviceContext @ 000002478c1ad4c0] Calling decoder->cvdl->cuvidUnmapVideoFrame(decoder->decoder, devptr) [AVHWDeviceContext @ 000002478c1ad4c0] Calling decoder->cudl->cuCtxPopCurrent(&dummy) [AVFilterGraph @ 000002478c25d380] Setting 'w' to value '1920' [AVFilterGraph @ 000002478c25d380] Setting 'h' to value '-1' [AVFilterGraph @ 000002478c25d380] Setting 'interp_algo' to value 'lanczos' [AVFilterGraph @ 000002478c25d380] Setting 'derive_device' to value 'vulkan' [AVFilterGraph @ 000002478c25d380] Setting 'format' to value 'yuv420p10le' [AVFilterGraph @ 000002478c25d380] Setting 'downscaler' to value 'ewa_lanczos' [libplacebo @ 00000247ecf50480] Initialized libplacebo v6.287.0-rc1-48-gdec96590 (API v290) [graph 0 input from stream 0:0 @ 000002478c1a32c0] Setting 'video_size' to value '3840x2160' [graph 0 input from stream 0:0 @ 000002478c1a32c0] Setting 'pix_fmt' to value '117' [graph 0 input from stream 0:0 @ 000002478c1a32c0] Setting 'time_base' to value '1/1000' [graph 0 input from stream 0:0 @ 000002478c1a32c0] Setting 'pixel_aspect' to value '1/1' [graph 0 input from stream 0:0 @ 000002478c1a32c0] Setting 'frame_rate' to value '24000/1001' [graph 0 input from stream 0:0 @ 000002478c1a32c0] w:3840 h:2160 pixfmt:cuda tb:1/1000 fr:24000/1001 sar:1/1 [format @ 000002478c1a3ac0] Setting 'pix_fmts' to value 'p010le' [libplacebo @ 00000247ecf50480] Imported vulkan device properties: [libplacebo @ 00000247ecf50480] Device Name: NVIDIA RTX A5500 Laptop GPU [libplacebo @ 00000247ecf50480] Device ID: 10de:2438 [libplacebo @ 00000247ecf50480] Driver version: 86064000 [libplacebo @ 00000247ecf50480] API version: 1.3.242 [libplacebo @ 00000247ecf50480] Restricting API version to 1.3.0... new version 1.3.0 [libplacebo @ 00000247ecf50480] vk->CreateCommandPool(vk->dev, &cinfo, PL_VK_ALLOC, &pool->pool) Last message repeated 2 times [libplacebo @ 00000247ecf50480] Memory heaps supported by device: [libplacebo @ 00000247ecf50480] 0: flags 0x1 size 15G [libplacebo @ 00000247ecf50480] 1: flags 0x0 size 15G [libplacebo @ 00000247ecf50480] Memory types supported by device: [libplacebo @ 00000247ecf50480] 0: flags 0x0 heap 1 [libplacebo @ 00000247ecf50480] 1: flags 0x1 heap 0 [libplacebo @ 00000247ecf50480] 2: flags 0x6 heap 1 [libplacebo @ 00000247ecf50480] 3: flags 0xe heap 1 [libplacebo @ 00000247ecf50480] 4: flags 0x7 heap 0 [libplacebo @ 00000247ecf50480] Memory summary: 0 used 0 res 0 alloc, efficiency 100.00%, utilization 100.00%, max page: 1011M [libplacebo @ 00000247ecf50480] shaderc SPIR-V version 1.6 rev 1 [libplacebo @ 00000247ecf50480] Initialized SPIR-V compiler 'shaderc' [libplacebo @ 00000247ecf50480] Minimum texel alignment: 48 [libplacebo @ 00000247ecf50480] vk->CreateSampler(vk->dev, &sinfo, PL_VK_ALLOC, &p->samplers[s][a]) Last message repeated 5 times [libplacebo @ 00000247ecf50480] GPU information: [libplacebo @ 00000247ecf50480] GLSL version: 450 (vulkan) [libplacebo @ 00000247ecf50480] max_shmem_size: 49152 [libplacebo @ 00000247ecf50480] max_group_threads: 1024 [libplacebo @ 00000247ecf50480] max_group_size[0]: 1024 [libplacebo @ 00000247ecf50480] max_group_size[1]: 1024 [libplacebo @ 00000247ecf50480] max_group_size[2]: 64 [libplacebo @ 00000247ecf50480] subgroup_size: 32 [libplacebo @ 00000247ecf50480] min_gather_offset: -32 [libplacebo @ 00000247ecf50480] max_gather_offset: 31 [libplacebo @ 00000247ecf50480] Limits: [libplacebo @ 00000247ecf50480] thread_safe: 1 [libplacebo @ 00000247ecf50480] callbacks: 1 [libplacebo @ 00000247ecf50480] max_buf_size: 17014579200 [libplacebo @ 00000247ecf50480] max_ubo_size: 65536 [libplacebo @ 00000247ecf50480] max_ssbo_size: 4294967295 [libplacebo @ 00000247ecf50480] max_vbo_size: 16975396864 [libplacebo @ 00000247ecf50480] max_mapped_size: 17014579200 [libplacebo @ 00000247ecf50480] max_buffer_texels: 134217728 [libplacebo @ 00000247ecf50480] align_host_ptr: 4096 [libplacebo @ 00000247ecf50480] host_cached: 1 [libplacebo @ 00000247ecf50480] max_tex_1d_dim: 32768 [libplacebo @ 00000247ecf50480] max_tex_2d_dim: 32768 [libplacebo @ 00000247ecf50480] max_tex_3d_dim: 16384 [libplacebo @ 00000247ecf50480] blittable_1d_3d: 1 [libplacebo @ 00000247ecf50480] buf_transfer: 1 [libplacebo @ 00000247ecf50480] align_tex_xfer_pitch: 1 [libplacebo @ 00000247ecf50480] align_tex_xfer_offset: 4 [libplacebo @ 00000247ecf50480] max_variable_comps: 0 [libplacebo @ 00000247ecf50480] max_constants: 18446744073709551615 [libplacebo @ 00000247ecf50480] max_pushc_size: 256 [libplacebo @ 00000247ecf50480] align_vertex_stride: 1 [libplacebo @ 00000247ecf50480] max_dispatch[0]: 2147483647 [libplacebo @ 00000247ecf50480] max_dispatch[1]: 65535 [libplacebo @ 00000247ecf50480] max_dispatch[2]: 65535 [libplacebo @ 00000247ecf50480] fragment_queues: 16 [libplacebo @ 00000247ecf50480] compute_queues: 8 [libplacebo @ 00000247ecf50480] External API interop: [libplacebo @ 00000247ecf50480] PCI: 0000:01:00:0 [libplacebo @ 00000247ecf50480] buf export caps: 0x16 [libplacebo @ 00000247ecf50480] buf import caps: 0x16 [libplacebo @ 00000247ecf50480] tex export caps: 0x16 [libplacebo @ 00000247ecf50480] tex import caps: 0x16 [libplacebo @ 00000247ecf50480] sync export caps: 0x6 [libplacebo @ 00000247ecf50480] sync import caps: 0x0 [libplacebo @ 00000247ecf50480] GPU texture formats: [libplacebo @ 00000247ecf50480] NAME TYPE SIZE COMP CAPS EMU DEPTH HOST_BITS GLSL_TYPE GLSL_FMT FOURCC [libplacebo @ 00000247ecf50480] r8 UNORM 1 R SsLRbBVutHWG n {8 0 0 0 } {8 0 0 0 } float r8 R8 [libplacebo @ 00000247ecf50480] r8s SNORM 1 R SsLRbBVutHWG n {8 0 0 0 } {8 0 0 0 } float r8_snorm [libplacebo @ 00000247ecf50480] rg8 UNORM 2 RG SsLRbBVutHWG n {8 8 0 0 } {8 8 0 0 } vec2 rg8 GR88 [libplacebo @ 00000247ecf50480] rg8s SNORM 2 RG SsLRbBVutHWG n {8 8 0 0 } {8 8 0 0 } vec2 rg8_snorm [libplacebo @ 00000247ecf50480] rgba8 UNORM 4 RGBA SsLRbBVutHWG n {8 8 8 8 } {8 8 8 8 } vec4 rgba8 AB24 [libplacebo @ 00000247ecf50480] rgba8s SNORM 4 RGBA SsLRbBVutHWG n {8 8 8 8 } {8 8 8 8 } vec4 rgba8_snorm [libplacebo @ 00000247ecf50480] bgra8 UNORM 4 BGRA SsLRbBVutHWG n {8 8 8 8 } {8 8 8 8 } vec4 rgba8 AR24 [libplacebo @ 00000247ecf50480] rgb10a2 UNORM 4 RGBA SsLRbBVutHWG n {10 10 10 2 } {10 10 10 2 } vec4 rgb10_a2 AB30 [libplacebo @ 00000247ecf50480] r16 UNORM 2 R SsLRbBVutHWG n {16 0 0 0 } {16 0 0 0 } float r16 R16 [libplacebo @ 00000247ecf50480] r16hf FLOAT 2 R SsLRbBVutHWG n {16 0 0 0 } {16 0 0 0 } float r16f [libplacebo @ 00000247ecf50480] r16s SNORM 2 R SsLRbBVutHWG n {16 0 0 0 } {16 0 0 0 } float r16_snorm [libplacebo @ 00000247ecf50480] rg16 UNORM 4 RG SsLRbBVutHWG n {16 16 0 0 } {16 16 0 0 } vec2 rg16 GR32 [libplacebo @ 00000247ecf50480] rg16hf FLOAT 4 RG SsLRbBVutHWG n {16 16 0 0 } {16 16 0 0 } vec2 rg16f [libplacebo @ 00000247ecf50480] rg16s SNORM 4 RG SsLRbBVutHWG n {16 16 0 0 } {16 16 0 0 } vec2 rg16_snorm [libplacebo @ 00000247ecf50480] rgba16 UNORM 8 RGBA SsLRbBVutHWG n {16 16 16 16} {16 16 16 16} vec4 rgba16 [libplacebo @ 00000247ecf50480] rgba16hf FLOAT 8 RGBA SsLRbBVutHWG n {16 16 16 16} {16 16 16 16} vec4 rgba16f AB4H [libplacebo @ 00000247ecf50480] rgba16s SNORM 8 RGBA SsLRbBVutHWG n {16 16 16 16} {16 16 16 16} vec4 rgba16_snorm [libplacebo @ 00000247ecf50480] r32f FLOAT 4 R SsLRbBVutHWG n {32 0 0 0 } {32 0 0 0 } float r32f [libplacebo @ 00000247ecf50480] rg32f FLOAT 8 RG SsLRbBVutHWG n {32 32 0 0 } {32 32 0 0 } vec2 rg32f [libplacebo @ 00000247ecf50480] rgba32f FLOAT 16 RGBA SsLRbBVutHWG n {32 32 32 32} {32 32 32 32} vec4 rgba32f [libplacebo @ 00000247ecf50480] r8i SINT 1 R Ss-R-BVutHWG n {8 0 0 0 } {8 0 0 0 } int r8i [libplacebo @ 00000247ecf50480] r8u UINT 1 R Ss-R-BVutHWG n {8 0 0 0 } {8 0 0 0 } uint r8ui [libplacebo @ 00000247ecf50480] rg8i SINT 2 RG Ss-R-BVutHWG n {8 8 0 0 } {8 8 0 0 } ivec2 rg8i [libplacebo @ 00000247ecf50480] rg8u UINT 2 RG Ss-R-BVutHWG n {8 8 0 0 } {8 8 0 0 } uvec2 rg8ui [libplacebo @ 00000247ecf50480] rgba8i SINT 4 RGBA Ss-R-BVutHWG n {8 8 8 8 } {8 8 8 8 } ivec4 rgba8i [libplacebo @ 00000247ecf50480] rgba8u UINT 4 RGBA Ss-R-BVutHWG n {8 8 8 8 } {8 8 8 8 } uvec4 rgba8ui [libplacebo @ 00000247ecf50480] rgb10a2u UINT 4 RGBA Ss-R-BVutHWG n {10 10 10 2 } {10 10 10 2 } uvec4 rgb10_a2ui [libplacebo @ 00000247ecf50480] r16i SINT 2 R Ss-R-BVutHWG n {16 0 0 0 } {16 0 0 0 } int r16i [libplacebo @ 00000247ecf50480] r16u UINT 2 R Ss-R-BVutHWG n {16 0 0 0 } {16 0 0 0 } uint r16ui [libplacebo @ 00000247ecf50480] rg16i SINT 4 RG Ss-R-BVutHWG n {16 16 0 0 } {16 16 0 0 } ivec2 rg16i [libplacebo @ 00000247ecf50480] rg16u UINT 4 RG Ss-R-BVutHWG n {16 16 0 0 } {16 16 0 0 } uvec2 rg16ui [libplacebo @ 00000247ecf50480] rgba16i SINT 8 RGBA Ss-R-BVutHWG n {16 16 16 16} {16 16 16 16} ivec4 rgba16i [libplacebo @ 00000247ecf50480] rgba16u UINT 8 RGBA Ss-R-BVutHWG n {16 16 16 16} {16 16 16 16} uvec4 rgba16ui [libplacebo @ 00000247ecf50480] r32i SINT 4 R Ss-R-BVutHWG n {32 0 0 0 } {32 0 0 0 } int r32i [libplacebo @ 00000247ecf50480] r32u UINT 4 R Ss-R-BVutHWG n {32 0 0 0 } {32 0 0 0 } uint r32ui [libplacebo @ 00000247ecf50480] rg32i SINT 8 RG Ss-R-BVutHWG n {32 32 0 0 } {32 32 0 0 } ivec2 rg32i [libplacebo @ 00000247ecf50480] rg32u UINT 8 RG Ss-R-BVutHWG n {32 32 0 0 } {32 32 0 0 } uvec2 rg32ui [libplacebo @ 00000247ecf50480] rgba32i SINT 16 RGBA Ss-R-BVutHWG n {32 32 32 32} {32 32 32 32} ivec4 rgba32i [libplacebo @ 00000247ecf50480] rgba32u UINT 16 RGBA Ss-R-BVutHWG n {32 32 32 32} {32 32 32 32} uvec4 rgba32ui [libplacebo @ 00000247ecf50480] bgr10a2 UNORM 4 BGRA S-LRbBVu-H-G n {10 10 10 2 } {10 10 10 2 } vec4 AR30 [libplacebo @ 00000247ecf50480] bgr5a1 UNORM 2 BGRA S-LRbB-u-H-G n {5 5 5 1 } {5 5 5 1 } vec4 AR15 [libplacebo @ 00000247ecf50480] bgr565 UNORM 2 BGR S-LRbB-u-H-G n {5 6 5 0 } {5 6 5 0 } vec3 RG16 [libplacebo @ 00000247ecf50480] a1rgb5 UNORM 2 ARGB S-L----u-H-G n {1 5 5 5 } {1 5 5 5 } vec4 BA15 [libplacebo @ 00000247ecf50480] a1bgr5 UNORM 2 ABGR S-L----u-H-G n {1 5 5 5 } {1 5 5 5 } vec4 RA15 [libplacebo @ 00000247ecf50480] gr4 UNORM 1 GR S-L----u-H-G n {4 4 0 0 } {4 4 0 0 } vec2 [libplacebo @ 00000247ecf50480] argb4 UNORM 2 ARGB S-L----u-H-G n {4 4 4 4 } {4 4 4 4 } vec4 BA12 [libplacebo @ 00000247ecf50480] abgr4 UNORM 2 ABGR S-L----u-H-G n {4 4 4 4 } {4 4 4 4 } vec4 RA12 [libplacebo @ 00000247ecf50480] rgb565 UNORM 2 RGB S-L----u-H-G n {5 6 5 0 } {5 6 5 0 } vec3 BG16 [libplacebo @ 00000247ecf50480] bgra8i SINT 4 BGRA S-----Vu-H-G n {8 8 8 8 } {8 8 8 8 } ivec4 [libplacebo @ 00000247ecf50480] bgra8u UINT 4 BGRA S-----Vu-H-G n {8 8 8 8 } {8 8 8 8 } uvec4 [libplacebo @ 00000247ecf50480] bgr10a2u UINT 4 BGRA S-----Vu-H-G n {10 10 10 2 } {10 10 10 2 } uvec4 [libplacebo @ 00000247ecf50480] rx10 UNORM 2 R S-L------H-G n {10 0 0 0 } {16 0 0 0 } [libplacebo @ 00000247ecf50480] rxgx10 UNORM 4 RG S-L------H-G n {10 10 0 0 } {16 16 0 0 } [libplacebo @ 00000247ecf50480] rxgxbxax10 UNORM 8 RGBA S-L------H-G n {10 10 10 10} {16 16 16 16} AB10 [libplacebo @ 00000247ecf50480] rx12 UNORM 2 R S-L------H-G n {12 0 0 0 } {16 0 0 0 } [libplacebo @ 00000247ecf50480] rxgx12 UNORM 4 RG S-L------H-G n {12 12 0 0 } {16 16 0 0 } [libplacebo @ 00000247ecf50480] rxgxbxax12 UNORM 8 RGBA S-L------H-G n {12 12 12 12} {16 16 16 16} [libplacebo @ 00000247ecf50480] rgb32f FLOAT 12 RGB ------Vu-H-- n {32 32 32 0 } {32 32 32 0 } vec3 [libplacebo @ 00000247ecf50480] rgb32i SINT 12 RGB ------Vu-H-- n {32 32 32 0 } {32 32 32 0 } ivec3 [libplacebo @ 00000247ecf50480] rgb32u UINT 12 RGB ------Vu-H-- n {32 32 32 0 } {32 32 32 0 } uvec3 [libplacebo @ 00000247ecf50480] rgb8i SINT 3 RGB ------V--H-- n {8 8 8 0 } {8 8 8 0 } ivec3 [libplacebo @ 00000247ecf50480] rgb8s SNORM 3 RGB ------V--H-- n {8 8 8 0 } {8 8 8 0 } vec3 [libplacebo @ 00000247ecf50480] rgb8u UINT 3 RGB ------V--H-- n {8 8 8 0 } {8 8 8 0 } uvec3 [libplacebo @ 00000247ecf50480] bgr8 UNORM 3 BGR ------V--H-- n {8 8 8 0 } {8 8 8 0 } vec3 RG24 [libplacebo @ 00000247ecf50480] bgr8i SINT 3 BGR ------V--H-- n {8 8 8 0 } {8 8 8 0 } ivec3 [libplacebo @ 00000247ecf50480] bgr8u UINT 3 BGR ------V--H-- n {8 8 8 0 } {8 8 8 0 } uvec3 [libplacebo @ 00000247ecf50480] rgb10a2i SINT 4 RGBA ------V--H-- n {10 10 10 2 } {10 10 10 2 } ivec4 [libplacebo @ 00000247ecf50480] rgb10a2s SNORM 4 RGBA ------V--H-- n {10 10 10 2 } {10 10 10 2 } vec4 [libplacebo @ 00000247ecf50480] bgr10a2i SINT 4 BGRA ------V--H-- n {10 10 10 2 } {10 10 10 2 } ivec4 [libplacebo @ 00000247ecf50480] bgr10a2s SNORM 4 BGRA ------V--H-- n {10 10 10 2 } {10 10 10 2 } vec4 [libplacebo @ 00000247ecf50480] rgb16hf FLOAT 6 RGB ------V--H-- n {16 16 16 0 } {16 16 16 0 } vec3 [libplacebo @ 00000247ecf50480] rgb16i SINT 6 RGB ------V--H-- n {16 16 16 0 } {16 16 16 0 } ivec3 [libplacebo @ 00000247ecf50480] rgb16s SNORM 6 RGB ------V--H-- n {16 16 16 0 } {16 16 16 0 } vec3 [libplacebo @ 00000247ecf50480] rgb16u UINT 6 RGB ------V--H-- n {16 16 16 0 } {16 16 16 0 } uvec3 [libplacebo @ 00000247ecf50480] rgb8 UNORM 3 RGB SsLRbBV--HWG y {8 8 8 0 } {8 8 8 0 } vec3 rgba8 BG24 [libplacebo @ 00000247ecf50480] rgb16 UNORM 6 RGB SsLRbBV--HWG y {16 16 16 0 } {16 16 16 0 } vec3 rgba16 [libplacebo @ 00000247ecf50480] r16f FLOAT 4 R SsLRbB---HWG y {16 0 0 0 } {32 0 0 0 } r16f [libplacebo @ 00000247ecf50480] rg16f FLOAT 8 RG SsLRbB---HWG y {16 16 0 0 } {32 32 0 0 } rg16f [libplacebo @ 00000247ecf50480] rgba16f FLOAT 16 RGBA SsLRbB---HWG y {16 16 16 16} {32 32 32 32} rgba16f [libplacebo @ 00000247ecf50480] rgb16f FLOAT 12 RGB ---------H-- y {16 16 16 0 } {32 32 32 0 } [libplacebo @ 00000247ecf50480] g8_b8_r8_420 UNORM 0 ------------ n {8 8 8 0 } {0 0 0 0 } YU12 [libplacebo @ 00000247ecf50480] g8_b8_r8_422 UNORM 0 ------------ n {8 8 8 0 } {0 0 0 0 } YU16 [libplacebo @ 00000247ecf50480] g8_b8_r8_444 UNORM 0 ------------ n {8 8 8 0 } {0 0 0 0 } YU24 [libplacebo @ 00000247ecf50480] g8_br8_420 UNORM 0 ------------ n {8 8 8 0 } {0 0 0 0 } NV12 [libplacebo @ 00000247ecf50480] g8_br8_422 UNORM 0 ------------ n {8 8 8 0 } {0 0 0 0 } NV16 [libplacebo @ 00000247ecf50480] g8_br8_444 UNORM 0 ------------ n {8 8 8 0 } {0 0 0 0 } NV24 [libplacebo @ 00000247ecf50480] gx10_bx10_rx10_420 UNORM 0 ------------ n {10 10 10 0 } {0 0 0 0 } [libplacebo @ 00000247ecf50480] gx10_bx10_rx10_422 UNORM 0 ------------ n {10 10 10 0 } {0 0 0 0 } [libplacebo @ 00000247ecf50480] gx10_bx10_rx10_444 UNORM 0 ------------ n {10 10 10 0 } {0 0 0 0 } Q410 [libplacebo @ 00000247ecf50480] gx10_bxrx10_420 UNORM 0 ------------ n {10 10 10 0 } {0 0 0 0 } P010 [libplacebo @ 00000247ecf50480] gx10_bxrx10_422 UNORM 0 ------------ n {10 10 10 0 } {0 0 0 0 } P210 [libplacebo @ 00000247ecf50480] gx10_bxrx10_444 UNORM 0 ------------ n {10 10 10 0 } {0 0 0 0 } [libplacebo @ 00000247ecf50480] gx12_bx12_rx12_420 UNORM 0 ------------ n {12 12 12 0 } {0 0 0 0 } [libplacebo @ 00000247ecf50480] gx12_bx12_rx12_422 UNORM 0 ------------ n {12 12 12 0 } {0 0 0 0 } [libplacebo @ 00000247ecf50480] gx12_bx12_rx12_444 UNORM 0 ------------ n {12 12 12 0 } {0 0 0 0 } [libplacebo @ 00000247ecf50480] gx12_bxrx12_420 UNORM 0 ------------ n {12 12 12 0 } {0 0 0 0 } P012 [libplacebo @ 00000247ecf50480] gx12_bxrx12_422 UNORM 0 ------------ n {12 12 12 0 } {0 0 0 0 } [libplacebo @ 00000247ecf50480] gx12_bxrx12_444 UNORM 0 ------------ n {12 12 12 0 } {0 0 0 0 } [libplacebo @ 00000247ecf50480] g16_b16_r16_420 UNORM 0 ------------ n {16 16 16 0 } {0 0 0 0 } [libplacebo @ 00000247ecf50480] g16_b16_r16_422 UNORM 0 ------------ n {16 16 16 0 } {0 0 0 0 } [libplacebo @ 00000247ecf50480] g16_b16_r16_444 UNORM 0 ------------ n {16 16 16 0 } {0 0 0 0 } [libplacebo @ 00000247ecf50480] g16_br16_420 UNORM 0 ------------ n {16 16 16 0 } {0 0 0 0 } P016 [libplacebo @ 00000247ecf50480] g16_br16_422 UNORM 0 ------------ n {16 16 16 0 } {0 0 0 0 } [libplacebo @ 00000247ecf50480] g16_br16_444 UNORM 0 ------------ n {16 16 16 0 } {0 0 0 0 } [auto_scale_0 @ 000002478c1a3bc0] w:iw h:ih flags:'' interl:0 [format @ 000002478c1a3ac0] auto-inserting filter 'auto_scale_0' between the filter 'Parsed_libplacebo_2' and the filter 'format' [AVFilterGraph @ 000002478c25d380] query_formats: 6 queried, 4 merged, 1 already done, 0 delayed [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuDeviceGetAttribute(&priv->tex_alignment, 14 , hwctx->internal->cuda_device) [AVHWFramesContext @ 000002478c25cf40] CUDA texture alignment: 512 [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuCtxPushCurrent(hwctx->cuda_ctx) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuMemAlloc(&data, size) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuCtxPopCurrent(&dummy) [Parsed_scale_cuda_0 @ 000002478c1a31c0] w:3840 h:2160 fmt:p010le -> w:1920 h:1080 fmt:p010le [Parsed_scale_cuda_0 @ 000002478c1a31c0] Calling cu->cuCtxPushCurrent(cuda_ctx) [Parsed_scale_cuda_0 @ 000002478c1a31c0] Calling cu->cuModuleLoadData(cu_module, buf) [Parsed_scale_cuda_0 @ 000002478c1a31c0] Calling cu->cuModuleGetFunction(&s->cu_func, s->cu_module, buf) [Parsed_scale_cuda_0 @ 000002478c1a31c0] Calling cu->cuModuleGetFunction(&s->cu_func_uv, s->cu_module, buf) [Parsed_scale_cuda_0 @ 000002478c1a31c0] Calling cu->cuCtxPopCurrent(&dummy) [hwupload @ 00000247ee968340] Surface format is cuda. [auto_scale_0 @ 000002478c1a3bc0] w:1920 h:1080 fmt:yuv420p10le sar:1/1 -> w:1920 h:1080 fmt:p010le sar:1/1 flags:0x00000004 [Parsed_scale_cuda_0 @ 000002478c1a31c0] Calling cu->cuCtxPushCurrent(s->hwctx->cuda_ctx) [Parsed_scale_cuda_0 @ 000002478c1a31c0] Calling cu->cuCtxPushCurrent(cuda_ctx) [Parsed_scale_cuda_0 @ 000002478c1a31c0] Calling cu->cuTexObjectCreate(&tex[i], &res_desc, &tex_desc, ((void *)0)) Last message repeated 1 times [Parsed_scale_cuda_0 @ 000002478c1a31c0] Calling cu->cuLaunchKernel(func, ( ((dst_width) + (32) - 1) / (32) ), ( ((dst_height) + (16) - 1) / (16) ), 1, 32, 16, 1, 0, s->cu_stream, args_uchar, ((void *)0)) Last message repeated 1 times [Parsed_scale_cuda_0 @ 000002478c1a31c0] Calling cu->cuTexObjectDestroy(tex[i]) Last message repeated 1 times [Parsed_scale_cuda_0 @ 000002478c1a31c0] Calling cu->cuCtxPopCurrent(&dummy) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuCtxPushCurrent(hwctx->cuda_ctx) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuMemAlloc(&data, size) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuCtxPopCurrent(&dummy) [Parsed_scale_cuda_0 @ 000002478c1a31c0] Calling cu->cuCtxPopCurrent(&dummy) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuCtxPushCurrent(cuda_dev->cuda_ctx) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuImportExternalMemory(&dst_int->ext_mem[i], &ext_desc) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuExternalMemoryGetMappedMipmappedArray(&dst_int->cu_mma[i], dst_int->ext_mem[i], &tex_desc) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuMipmappedArrayGetLevel(&dst_int->cu_array[i], dst_int->cu_mma[i], 0) [AVHWDeviceContext @ 000002478c1ad4c0] Calling cu->cuImportExternalSemaphore(&dst_int->cu_sem[i], &ext_sem_desc)
Cannot comment on whether there's an AVFrames-bug, but happy to provide further troubleshooting details.
cc @philipl
A cuda <-> vulkan interop command line looks something like this:
ffmpeg -hwaccel cuda -hwaccel_output_format cuda -i input.mp4 -vf hwupload=derive_device=vulkan,hflip_vulkan,hwupload=derive_device=cuda -c:v hevc_nvenc -preset p1 -tune 2 -y output.mp4
You must hwupload with derive_device
to allow the two APIs to work with each other.
But with that said, I can't get libplacebo to work as the vulkan filter. ffmpeg reports incompatible formats, and attempts to insert an auto scale filter, which obviously fails.
Thanks! Just found that priceless info šš»
scale_cuda
is working fine here:
ffmpeg -hwaccel cuda -hwaccel_output_format cuda -i "%INPUT%" -vf scale_cuda=w=1920:h=-1,hwupload=derive_device=vulkan,hflip_vulkan,hwupload=derive_device=cuda -c:v hevc_nvenc ...
But it's right that by providing -pix_fmt p010le
to that command (or by using libplacebo as filter) throws the formats/scale issue, which drove me crazy as well.
[format @ 000001e70801ba80] auto-inserting filter 'auto_scale_0' between the filter 'Parsed_hwupload_3' and the filter 'format'
Impossible to convert between the formats supported by the filter 'Parsed_hwupload_3' and the filter 'auto_scale_0'
[vf#0:0 @ 000001e708130180] Error reinitializing filters!
Failed to inject frame into filter network: Function not implemented
If your only goal is to convert 8bit content to 10bit, you can do that with scale_cuda by itself.
-vf scale_cuda=w=1920:h=-1:format=p010
Not exactly, i simplified the commands to post them here, my current goal is just downsampling 2160p to 1080p.
Long story short, a few years back i did own a GTX970M and been using FFmpeg alone for 2160p+10bit -> 1080p+8bit conversion with zscale + tin=smpte2084:etc.. + tonemap=hable
, later on found about the libplacebo project and its tonemapping=hable
alternative which did perform much better than FFmpeg (from ~10fps to ~30fps or such IIRC)
When i brought a few months back this laptop with a RTX A5500 i continued using the same commands, i also got a new TV capable of playing 10bits so there was no longer a need for the 8bit conversion, but even so continued using libplacebo as i understood was the way to preserve the "sei side-data" and such.. but yeah, i just found thanks to you that i could use FFmpeg alone again and it even outperforms what i was getting previously.
This is the full command i am using, and i am getting ~185fps with it, pretty awesome!
ffmpeg.exe -hwaccel cuda -hwaccel_output_format cuda -extra_hw_frames 3 -c:v hevc_cuvid -crop 188x188x0x0 -i "%INPUT%" -vf "scale_cuda=w=1920:h=-1:interp_algo=lanczos" -c:v hevc_nvenc -maxrate:v 3210k -bufsize:v 6M -preset:v p7 -rc:v vbr -tune:v hq -cq:v 8 -refs:v 16 -bf:v 3 -rc-lookahead:v 32 -g 240 -spatial_aq:v 1 -aq-strength:v 6 -tier high -af ... -v verbose "%OUTPUT%"
Well, whenever that format/scale filter issue is fixed we'll see if libplacebo does by any chance still provides even better performance using today's FFmpeg, but i'm quite happy already!
Thanks guys for your time and help, really appreciated it.
PS: There seems to be another issue while using the -crop
option, it does not happens always, only with certain files, causing playback to go back a few milliseconds with each (key)frame (sorry dunno the terminology to explain it properly) like if invalid dts/pts were injected in the output stream - but this is fully offtopic here now, just thought on mentioning it.
ffmpeg -init_hw_device vulkan=vk,disable_multiplane=1 -filter_hw_device vk -hwaccel nvdec -hwaccel_output_format cuda -i INPUT -vf hwupload=derive_device=vulkan,libplacebo,hwupload=derive_device=cuda -c:v hevc_nvenc OUTPUT
Does that actually work for you? It fails for me, which I'd expect - the initialised vulkan device isn't related to the cuda device used by nvdec so it's impossible to derive_device between them.
Does that actually work for you? It fails for me, which I'd expect - the initialised vulkan device isn't related to the cuda device used by nvdec so it's impossible to derive_device between them.
Yes, using the 536.40 driver
sigh I forgot to make sure it was creating the vulkan device on my nvidia GPU so it was using the Intel one...
After fixing that, it did indeed work. Thanks for the tip!
Something rather interesting is happening when i run that command here:
ffmpeg version N-111339-g36fae75d28-gd694c25b44+1 Copyright (c) 2000-2023 the FFmpeg developers
built with gcc 13.1.0 (Rev7, Built by MSYS2 project)
...
...
[AVHWDeviceContext @ 000002678877c340] cu->cuExternalMemoryGetMappedMipmappedArray(&dst_int->cu_mma[i], dst_int->ext_mem[i], &tex_desc) failed -> CUDA_ERROR_OUT_OF_MEMORY: out of memory
[hwupload @ 00000267ea6057c0] Failed to upload frame: -542398533.
[vf#0:0 @ 000002678881ff80] Error while filtering: Generic error in an external library
Failed to inject frame into filter network: Generic error in an external library
[vist#0:0/hevc @ 0000026788824c00] Decoder thread received EOF packet
[vist#0:0/hevc @ 0000026788824c00] Decoder returned EOF, finishing
[vist#0:0/hevc @ 0000026788824c00] Terminating decoder thread
Generic error in an external library
Bro, i do have 16GB š
Thanks for the tip in any case, yep.
PS: There seems to be another issue while using the -crop option, ....
I think i just found what broke this, seems it's from commit https://github.com/FFmpeg/FFmpeg/commit/402d98c9d467dff6931d906ebb732b9a00334e0b
(Sorry for hijacking with this here, but tried to post a comment on that commit and Github keeps giving me a 422
error)
i.e. when not using -surfaces
or by using -surfaces 5
the output video does have a trembling / shaking effect, while when specifying -surfaces 6
(or any higher value) it does not happens.
When it does happens, i can see the following kind of errors/warnings printed after a while:
[vost#0:0/hevc_nvenc @ 000001b9c9273a80] *** dropping frame 8559 at ts 8618
[vost#0:0/hevc_nvenc @ 000001b9c9273a80] *** dropping frame 8559 at ts 8620
[vost#0:0/hevc_nvenc @ 000001b9c9273a80] *** dropping frame 8559 at ts 8621
[vost#0:0/hevc_nvenc @ 000001b9c9273a80] *** dropping frame 8559 at ts 8622
[matroska @ 000001b9c92decc0] Invalid DTS: 356083 PTS: 356042 in output stream 0:0, replacing by guess
[matroska @ 000001b9c92decc0] Invalid DTS: 357917 PTS: 357875 in output stream 0:0, replacing by guess
[matroska @ 000001b9c92decc0] Invalid DTS: 358833 PTS: 358792 in output stream 0:0, replacing by guess
/cc @rarzumanyan (details about FFmpeg version / options / hardware used on https://github.com/haasn/libplacebo/issues/178#issuecomment-1616585078)
Well, with -surfaces 6
the issue does not happens, but i am now realizing a color-space problem in some scenes.
(left: down-sampled to 1080p, right: original 2160p)
edit: oddly enough, this has nothing to do with -c:v hevc_cuvid
or -crop
... it does happens always with this very specific input file... please let me know a better place to report this if needed š
edit2: this seems a very old bug... tried a Zeranoe build (git-2020-08-31-4a11a6f
) and it's also happening with it...
edit3: re-encoding in two steps using pipe:
and -tune lossless
as in the following command does fixes it (dunno if this will be helpful to anyone here, will probably end up removing this comment, sorry for bothering)
ffmpeg.exe -hwaccel cuda -i "%INPUT%" -c:v hevc_nvenc -pix_fmt yuv420p10le -tune lossless -c:a copy -map 0:v:0 -map 0:a? -v quiet -f matroska pipe: | ffmpeg.exe -hwaccel cuda -hwaccel_output_format cuda -extra_hw_frames 16 -c:v hevc_cuvid -crop 188x188x0x0 -surfaces 8 -f matroska -i pipe: -vf "scale_cuda=w=1920:h=-1:interp_algo=lanczos" -c:v hevc_nvenc ....
I haven't had any luck on my 6700XT, I just keep running out of VRAM...
I'm trying ffmpeg -y -i "input6.mp4" -init_hw_device vulkan -c:v hevc_amf -rc 0 -qp_p 19 -qp_i 19 -c:a copy -c:s mov_text -metadata:s:s:0 language=English -disposition:s:s:0 forced -vf format=yuv420p,hwupload,libplacebo=w=2560:h=1440:upscaler=ewa_lanczos:custom_shader_path=shaders/Anime4K_ModeA.glsl,hwdownload,format=yuv420p "output6.mp4"
But even if I get rid of all the upscaling/format/shaders/hwupload/download it still happens. I was able to do a few files successfully but most just go all the way to 12GB VRAM until it fails. EDIT: In case you're wondering it happens when doing CPU encoding too.
Hello there,
Looks like there have been some big changes recently, apparently since the time libplacebo started to require vulkan 1.3.0 and FFmpeg adapted accordingly to not restrict it to version 1.2.0 and such.
I've been fighting for the past few days trying to get my command(s) to work like they used to.
i.e. up until:
This used to work fine:
as per today's builds, i.e.
scale_cuda
is no longer working.hwupload
any longer.(ffmpeg does segfault when using that)
The more close command i got this working again with is the following:
However, i do get 70-90fps while previously it was 160-180fps
I've also tried to initialize vulkan directly with...
...which does speed it up to 130fps, but all i get back are green frames, and countless of those warnings printed:
I'm pretty sure i am doing something wrong here and already feeling i am losing my mind, so i will really appreciate some guidance/advice on how to get this back working at full speed šš»