oyxhust / CNN-LSTM-CTC-text-recognition

CNN and LSTM model for text recognition
259 stars 88 forks source link

mxnet the squeeze axis in your crnn model #14

Open 34153320 opened 6 years ago

34153320 commented 6 years ago

Hi, I looked into your code, in your crnn.py #132, wordvec shows with a squeeze axis =1. However, your data after flatten should be (batch_size, num_filters x reduced_width x reduced_height). Although the reduced_height =1, num_filters is 512 and you use a sequence_length=25. Only sequence_length equals to the second component in the shape parameters, it can use squeeze_axis =1. I am a little confused.... Thanks for your work. Appreciate!