rizkiarm / LipNet

Keras implementation of 'LipNet: End-to-End Sentence-level Lipreading'
MIT License
638 stars 226 forks source link

How to do real-time inference? #59

Open harryxu-yscz opened 6 years ago

harryxu-yscz commented 6 years ago

Thanks for this awesome repo! I want to test Lipnet on real-time video, i.e. Given a stream of image frames from the video (at 25 fps using the provided model) and output results at real time. However, am not sure how to do it.

I thought about splitting the incoming frames into mutually exclusive chunks, and generating outputs from each chunk. However, this approach may suffer when chunks end in the middle of utterance. Any suggestion?

HamdiKaptan commented 5 years ago

me too

pinest94 commented 5 years ago

me too

purpletpig commented 5 years ago

me too

chahatagarwal commented 4 years ago

Thanks for this awesome repo! I want to test Lipnet on real-time video, i.e. Given a stream of image frames from the video (at 25 fps using the provided model) and output results at real time. However, am not sure how to do it.

I thought about splitting the incoming frames into mutually exclusive chunks, and generating outputs from each chunk. However, this approach may suffer when chunks end in the middle of utterance. Any suggestion?

Did you achieve ?