yzhangcs / parser

:rocket: State-of-the-art parsers for natural language.
https://parser.yzhang.site/
MIT License
829 stars 141 forks source link

wonder why not use the default LSTM #17

Closed allanj closed 4 years ago

allanj commented 4 years ago

It seems you build an LSTM from the scratch

yzhangcs commented 4 years ago

As described in the paper:

we drop nodes in the LSTM layers (input and recurrent connections), applying the same dropout mask at every recurrent timestep (cf. the Bayesian dropout of Gal & Ghahramani (2015));