cfeng16 / audio-visual-forensics

https://cfeng16.github.io/audio-visual-forensics/
MIT License
60 stars 5 forks source link

Training Code and Dataset Split #9

Open unintendedly opened 1 month ago

unintendedly commented 1 month ago

Thank you for your outstanding contribution and excellent work. I am very interested in your work, but I noticed significant discrepancies in the results when testing on FakeAVCeleb. Could you provide the complete training code? Additionally, regarding the dataset partitioning, how are the forged videos sampled? What are the proportions of different categories, such as forged video with forged audio versus forged video with real audio?

cfeng16 commented 1 month ago

Sorry at this moment I don't have much time to clean up training code, but can you let me know significant discrepancies more specifically? i am very curious and can help

unintendedly commented 1 month ago

The AP value in the RVFA category is 90.6, which is significantly higher than the 70.6 mentioned in the original text. However, the AUC for the RVFA category is 74.6, slightly lower than the 80.5 in the original text. The most puzzling part is that the AUC for the FVRA category is only 74.1, which is much lower than the 93.7 reported in the original text. I believe this result is quite abnormal and difficult to explain. Moreover, the AUC of 74.1 for the FVRA category is the average from multiple experiments, so it should not be a coincidence.