glample / tagger

Named Entity Recognition Tool
Apache License 2.0
1.16k stars 426 forks source link

How does the reported 90.93 reported? #70

Closed jiesutd closed 6 years ago

jiesutd commented 6 years ago

Thanks for sharing this nice work!

I am wondering how is the 90.93 reported? Is it the mean value of the system performance under different random seeds or the best result among several random seeds?

Thank you very much!

glample commented 6 years ago

Hi,

Yes, this is an average over several experiments using different random seeds. There was a bit of variance. In particular, the best results on the validation set were often very close, but the score on the test set associated to these best scores on the validation set were sometimes quite different (I guess the validation set was a bit small). Results on the test were oscillating between 90.4 and 91.3 or something like that.

jiesutd commented 6 years ago

Thank you very much!