dynamic vs. static AU prediction

TadasBaltrusaitis / OpenFace

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

Other

6.72k stars 1.82k forks source link

The calibration step assumes that the most common expression in the sequence is neutral or something close to it (which is true in general as people are more non-expressive than expressive). It uses this assumption to compute a "neutral frame" which is then subtracted from all other frames to base a prediction on. If a person is always holding a same expression the algorithm will mistake that as a neutral harming the performance.

The disadvantage of using static prediction is due to everyone expressing AUs slightly differently, and having access to the neutral expression to subtract is actually very beneficial to prediction of certain AUs (but not all). For more details see Table 10 in the OpenFace paper comparing performance between static and dynamic models.

TadasBaltrusaitis / OpenFace

dynamic vs. static AU prediction #940