Open mungg opened 5 years ago
I met the same problem. I used the embedding "glove.6B.300d.txt" and tested the two versions of your code. One model is "chars_lstm_lstm_crf model ", the other is "sequence_tagging". From my results, the chars_lstm_lstm_crf model performs worse in the same condition. Is this common?
@mungg and @lanzhuzhu , Try using the 840B.300d vectors as well as the BIOES tagging scheme. You should be able to get good results (in average). Note that some runs perform better than others and the results I report on the README are averaged over 5 runs.
Thanks for your clear code and instruction! I run a code on CONLL2003 without modifying other hyper-parameter and structure, the F1 score of testb is about to 90.6 which is not reaching the your report(91.2) (at char CNN + LSTM+CRF) also, LSTM+CRF shows the 89.xx~ 90.0x. I got a little bit lower scores comparing with your reports. I was wondering if it needed to change the default hyper-parameter.
Could your provide some insight about this result?