Closed dungtn closed 6 years ago
Hi @glample,
I'm trying to overfit the LSTM without the CRF layer on a small dataset but the dev/test scores was about ~2%. I used the --crf 0 option, do I need to take any further steps?
What's your learning rate? Try use adadelta instead of sgd.
It's because I didn't use the pre-trained skip100.
Hi @glample,
I'm trying to overfit the LSTM without the CRF layer on a small dataset but the dev/test scores was about ~2%. I used the --crf 0 option, do I need to take any further steps?