tzirakis / Multimodal-Emotion-Recognition

This repository contains the code for the paper `End-to-End Multimodal Emotion Recognition using Deep Neural Networks`.
http://ieeexplore.ieee.org/document/8070966/
BSD 3-Clause "New" or "Revised" License
238 stars 76 forks source link

How to preprocess a subject's video? #4

Open yaonaiming opened 5 years ago

yaonaiming commented 5 years ago

According to the paper, the cropped faces of subject's video are taken as input to the video network. My question is about the preprocessing pipeline. Which method or tool do you use to crop faces? How to align these faces? How to deal with the case where the facial landmark detection failed? In data_provider.py, a sample is composed by the [frame, audio_sample, lable, subject_id] tensors, but in data_generator.py, there is no way to compute the frame tensor.

She-yh commented 1 year ago

Hi! Now I have the same problem. Have you found the solution?