jiesutd / LatticeLSTM

Chinese NER using Lattice LSTM. Code for ACL 2018 paper.
1.8k stars 453 forks source link

About regularization #42

Closed Kiwisher closed 5 years ago

Kiwisher commented 5 years ago

I read in the paper that you set a weight-decay in the optimizer, but I didn't see that term in your initialization of optimizer in main.py here. I wonder if I have skipped something or you really didn't set the regularization in your code? Thanks.

jiesutd commented 5 years ago

https://github.com/jiesutd/LatticeLSTM/blob/3f960348ea07b065afff653949046f09470332f4/main.py#L253

jiesutd commented 5 years ago

For the regularization, yes we found the regularization does not affect the results a lot, so we removed this part. As the dropout is already a good way to avoid overfitting.