Dataset of Perceived Intoxicated Faces for Drunk Person Identification
This repository contains the code for the deep learning models in the paper Dataset of Perceived Intoxicated Faces for Drunk Person Identification by Vineet Mehta, Devendra Pratap Yadav, Sai Srinadhu Katta and Abhinav Dhall.
This repository contains the code for the feature extraction and the experiments. (We cannot provide support for programming issues, thanks for your understanding). For dataset sharing and creation, check our website-https://sites.google.com/view/difproject/home
Major libraries used
- Common Libraires
- tqdm
- numpy
- opencv
- matplotlib
- Specific Libraries
- Audio
- keras
- openSMILE- Command line- https://audeering.github.io/opensmile/get-started.html
- CNN_RNN
- keras
- 3D CNN and variants
- Pytorch
Files/Folders description
- Audio_models - This folder contains the code for the audio feature extraction and training code for all audio models.
- CNN_RNN - This folder contains the code for the visual feature extraction and the training code for the CNN_RNN models.
- 3D_CNN_and_variants - This folder contains the code for the training of 3D CNN models and it's various variants.
- split.csv - Data split used in the experiments.
- test.ipynb - This jupyter notebook contains the testing code for the best models and the ensemble startegy as discussed in the paper. The final hyperparameter setting and the model configuration can be seen in this notebook.
- 3D_pred.csv - This file contains the predictions by the best 3D CNN varaints.