ControlNet / AV-Deepfake1M

[ACM MM] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
https://arxiv.org/abs/2311.15308
Other
69 stars 3 forks source link

asking the performance on AV-deepfake-1M #1

Closed isjwdu closed 8 months ago

isjwdu commented 9 months ago

Have the performance metrics shown in the paper (Table 6. Temporal deepfake localization benchmark) been trained on the av-deepfake-1m dataset? Or are they direct inference using the pre-trained models provided by the previous methods?

I tried using UMMAFormer for inference at AV-deepfake-1m, and even though it's only on a portion of the data, the performance is way behind what's shown in Table 6.

Thanks!