Split GRID videos into words and store result

danisbet / machine-lip-reading

Using an LSTM and 4d convolutional network for lip reading

12 stars 7 forks source link

Open alexvlis opened 6 years ago

alexvlis commented 6 years ago

GRID dataset videos are in sentences. So if we want to do lip reading for words, we need to split the videos into words.