CTC loss for test_keras.py throws error when Masking layer is used

mcf06 / theano_ctc

Theano bindings for Baidu's CTC library.

BSD 3-Clause "New" or "Revised" License

20 stars 5 forks source link

When a masking layer is used for speech utterances of variable length, an input dimension mis-match error is thrown. The following is the edited model from the test_keras.py to reproduce the error.

model = Sequential() model.add(Masking(mask_value=0., input_shape=(frame_len, nb_feat))) model.add(LSTM(inner_dim, return_sequences = True)) model.add(BatchNormalization()) model.add(TimeDistributed(Dense(nb_output)))

ValueError: GpuElemwise. Input dimension mis-match. Input 1 (indices start at 0) has shape[1] == 80, but the output's size on that axis is 16.

Please suggest how can a Masking layer be used when using CTC loss with Keras.

mcf06 / theano_ctc

CTC loss for test_keras.py throws error when Masking layer is used #13