Closed zackerthescar closed 1 year ago
woo! We got our first arm commit! thank you @zackerthescar
Yes, the performance benefits may be related to some schedule issues. But generally speaking, it can get 1080P 30~150 fps, pretty useable👍
This last function seems to take longer than the C version. I don't know why, but I'm not exactly wiling to work more on this for now, I'd rather spend time on the x86 deblocker.
yes go ahead and continue working on x86. We can check arm later. thank you
@zackerthescar , plan to send upstream review for v3. It's c code only. could you help test c code only on M1 pro again, using https://github.com/ffvvc/FFmpeg/wiki#performance-data ? thank you
@nuomi2021 Sorry for the late reply
Compiled the current 577acde commit with ./configure --enable-gpl --enable-debug --disable-asm
Performance on M1 Pro Macbook Pro (6x Performance; 2x Efficiency cores) on macOS 13.4.1 |
Clip | FPS |
---|---|---|
Chimera_8bit_1080P_1000_frames.vvc | 153.3 | |
RitualDance_1920x1080_60_10_420_32_LD.266 | 105.0 | |
BQTerrace_1920x1080_60_10_420_22_RA.vvc | 58.7 | |
Tango2_3840x2160_60_10_420_27_LD.266 | 21.7 | |
RitualDance_1920x1080_60_10_420_37_RA.266 | 133.0 | |
NovosobornayaSquare_1920x1080.bin | 150.3 |
I recently got an M1 Pro MacBook Pro (6x Performance Cores, 2x Efficiency Cores). As a distraction and exercise, I ported the HEVC SAO NEON code to VVC.
This last function seems to take longer than the C version. I don't know why, but I'm not exactly wiling to work more on this for now, I'd rather spend time on the x86 deblocker.
Benchmark on M1 Pro (6x Performance Cores, 2x Efficiency Cores)
These numbers seem a little too good to be true, but I ran the benchmark again and I got roughly the same results.