Closed dagongji10 closed 3 years ago
Yes, they should be different.
CTW1500 use max_len: 100, which is because it is based on text-line level, i.e., one text instance could possibly contain many characters. The BezierAlign sampling grid is also different. For CTW1500, it is (8,128), while for the latter it is (8,32).
@Yuliang-Liu thanks so much.
I have tried ABCNet use 2 pre-trained model
ctw1500_attn_R_50.pth
andtt_e2e_attn_R_50.pth
, but I found there is some diffs between the size of these 2pth
file.ctw1500_attn_R_50.pth
is 282M, buttt_e2e_attn_R_50.pth
is 142M. So, when train ABCNet use CTW1500 and TotalText, the network is different? Why the model does not have same size?