Open LanWong1 opened 4 years ago
The model recognize padded square text image by default to support arbitrary-shaped image. If you only want to recognize horizontal text, you can modify the input image shape and retrain the model.
The model recognize padded square text image by default to support arbitrary-shaped image. If you only want to recognize horizontal text, you can modify the input image shape and retrain the model.
sorry, I did not describe my question clearly. The horizontal text means the horizontal text detected in a square image
I got it! firstly crop the text region, then resize and padding to 256, finally send the resized image to the model. while I think 256*256 is too large, this may cause large time costs
I use 256256 image with CNN stride 8, so the feature map size is 32 32, which is able to recognition 32 characters in horizontal or vertical line.
Hi @zhang0jhon : I think the 32*32 feature map size does not necessarily imply the model is able to recognition 32 characters in horizontal or vertical line. Based on this line it only depends on seq_len, which is predefined.
does the model work with the text image like this directly?