ttgeng233 / UnAV

Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)
https://unav100.github.io
MIT License
54 stars 4 forks source link

Add temporal maxer code #4

Closed ed-fish closed 11 months ago

ed-fish commented 11 months ago

Add in temporal maxer and split code for future dev with audio data