astorfi / lip-reading-deeplearning

:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
Apache License 2.0
1.84k stars 321 forks source link

error when changing the input size #14

Closed Perseus1993 closed 6 years ago

Perseus1993 commented 6 years ago

I changed the input image size into: 'mouth': np.random.random_sample(size=(num_training_samples, 9, 64, 64, 1))

then the error Negative dimension size caused by subtracting 5 from 3 for 'tower_0/mouth_cnn/fc5/fc5_1/convolution' (op: 'Conv3D') with input shapes: [?,9,3,3,128], [1,2,5,128,256]

How should I change the net structure to fix that? thank you

astorfi commented 6 years ago

You should have an architecture regarding your input shape. I do not know what architecture can fit your input but the architecture related to your visual network must be changed.

Perseus1993 commented 6 years ago

already fixed it, after looking into network structures

astorfi commented 6 years ago

OK cool