I am trying to add additional characters (--sensitive option) by fine-tuning existing non-sensitive model
I am using None-VGG-BiLSTM-CTC (a.k.a CRNN) model, which had accuracy of 62.795% on IC15_2077 test set
I have removed final layer of the pre-trained state_dict() in order to solve size mismatch problem.
Also, I have frozen all layers except final fully connected layer in order to only train classifier layer.
Otherwise, all options are equal to the default settings.
I used ST_spe (one with special characters) dataset to fine_tune, validation loss (on IC15_2077), train loss and accuracy values do not improve. (I have tested to train until 100000 steps).
What might be the main problem here? Dataset Size? (1.5M though...) Freezing the layers?
Hello,
I am trying to add additional characters (--sensitive option) by fine-tuning existing non-sensitive model
I am using None-VGG-BiLSTM-CTC (a.k.a CRNN) model, which had accuracy of
62.795%
on IC15_2077 test setI have removed final layer of the pre-trained state_dict() in order to solve size mismatch problem. Also, I have frozen all layers except final fully connected layer in order to only train classifier layer. Otherwise, all options are equal to the default settings.
I used ST_spe (one with special characters) dataset to fine_tune, validation loss (on IC15_2077), train loss and accuracy values do not improve. (I have tested to train until 100000 steps).
What might be the main problem here? Dataset Size? (1.5M though...) Freezing the layers?
Following is my opt.txt
Following is one validation log from opt_train.txt