How to handle LRW dataset where speakers move significantly

I did the preprocessing in the same way as I did for the other datasets ( I use the face-processor library available on my GitHub). You are correct that lrw has jitter due to the way it was aligned and unfortunately there is no easy way of removing it. I have tried to stabilise the videos but this only helps a little. Because lrw has this jitter the model will end up modelling it and you will see it (to some extent) in the generated videos. In the end I'm afraid you have to live with a little jitter for the lrw model.

DinoMan / speech-driven-animation

How to handle LRW dataset where speakers move significantly #22