Combining EfficientNet and Vision Transformers

Code for Video Deepfake Detection model from "Combining EfficientNet and Vision Transformers for Video Deepfake Detection" presented at ICIAP 2021.

MIT License

207 stars 53 forks source link

Hi. I am not sure I got you question. If you are looking for a simpler deepfake detection approach, I can send you this one: https://github.com/selimsef/dfdc_deepfake_challenge This is the approach on which mine is based, anyway, almost all deepfake detection methods have a face detection/extraction step and then some other more sophisticated and customized parts like this and our other work (https://github.com/davide-coccomini/MINTIME-Multi-Identity-size-iNvariant-TIMEsformer-for-Video-Deepfake-Detection). So you will always find a similar preprocessing part in almost all works.

If you are facing some specific problems with our approach feel free to ask anything and I'll try to help!

davide-coccomini / Combining-EfficientNet-and-Vision-Transformers-for-Video-Deepfake-Detection