davide-coccomini / Combining-EfficientNet-and-Vision-Transformers-for-Video-Deepfake-Detection

Code for Video Deepfake Detection model from "Combining EfficientNet and Vision Transformers for Video Deepfake Detection" presented at ICIAP 2021.
https://dl.acm.org/doi/abs/10.1007/978-3-031-06433-3_19
MIT License
207 stars 53 forks source link

Combining EfficientNet and Vision Transformers #44

Closed TienLort closed 9 months ago

TienLort commented 1 year ago

Hello, I'm a university student, I'm currently researching the topic of deepfake detection. I also want to follow the direction of Combining-EfficientNet-and-Vision-Transformers like you but it seems it's quite difficult for you to detect. develop a simpler project than this one, can you give me a reference? thank you

davide-coccomini commented 1 year ago

Hi. I am not sure I got you question. If you are looking for a simpler deepfake detection approach, I can send you this one: https://github.com/selimsef/dfdc_deepfake_challenge This is the approach on which mine is based, anyway, almost all deepfake detection methods have a face detection/extraction step and then some other more sophisticated and customized parts like this and our other work (https://github.com/davide-coccomini/MINTIME-Multi-Identity-size-iNvariant-TIMEsformer-for-Video-Deepfake-Detection). So you will always find a similar preprocessing part in almost all works.

If you are facing some specific problems with our approach feel free to ask anything and I'll try to help!