请教 end-to-end audiovisual speech recognition的.npy的文件生成问题

sailordiary / LipNet-PyTorch

"LipNet: End-to-End Sentence-level Lipreading" in PyTorch

BSD 3-Clause "New" or "Revised" License

65 stars 20 forks source link

Closed wanli4717 closed 5 years ago

wanli4717 commented 5 years ago

我是天津大学的一名研究生，，目前在复现end-to-end audiovisual speech recognition 论文实验结果，我看见了您在博主的git上留言了，并发现了其中的错误，我想找您请教一下那篇论文生成.npy的文件问题

sailordiary commented 5 years ago

Hi,

You can refer to https://github.com/mpc001/end-to-end-lipreading/issues/6#issuecomment-449582378 . The expected format is a numpy array of shape [29, 96, 96]. If you want to save the videos as npy files just do an np.save on the array of stacked frames (i.e. batch_feats).