Open alexvlis opened 6 years ago
GRID dataset videos are in sentences. So if we want to do lip reading for words, we need to split the videos into words.
GRID dataset videos are in sentences. So if we want to do lip reading for words, we need to split the videos into words.