maincold2 / FFNeRV

FFNeRV: Flow-Guided Frame-Wise Neural Representations for Videos
MIT License
80 stars 1 forks source link

Note on comparisons #1

Closed ngc5195bee closed 1 year ago

ngc5195bee commented 1 year ago

Hey. Great paper and exciting decode FPS results. However, I had two issues with your comparisons:

The work is suitably impressive despite it being quite a bit worse than SOTA neural codecs (they cannot run at nearly 60 FPS!)

maincold2 commented 1 year ago

1) Since we thought it is needless to say that the commercial codecs are capable of fast decoding through hardware acceleration, we just evaluated them with CPU as in the NeRV paper. We agree with you that it can't be claimed FFNeRV shows faster decoding than the standard codecs. We would like to clarify this in the paper. In terms of decoding speed, we want to emphasize that FFNeRV outperforms other neural codecs!

2) Although our curve slightly beats FVC, we didn't claim that our method is SOTA. We intended to show that our approach performs similarly to FVC, which is a recent and well-known neural codec. Please note that our method achieves SOTA compression performance among neural representations.

We appreciate your interest and thoughtful suggestions!

ngc5195bee commented 1 year ago

Thanks for the reply! I think this is valuable work and is likely an extension of the efficient frontier between performance and decode time. I hope you continue to improve

On reflection, I think your codec is really random-access. Comparisons on both HEVC and NN are performed vs low-delay. HEVC, H.264 experience a 30% BD rate gain in random-access mode. That said, very few people publish NN random-access.