Closed chaitjo closed 7 years ago
@zhangjcqq is the author of this demo.
The parameter configuration is provided for training the CoNLL dataset, while that training data set is not public. Therefore, we use its test part in this demo, just show how to re-implement the related paper's experiments. Actually, running this demo in multiple pass is less meaningful for that user can not obtain a practicable system without big training data set. The float exception is caused by numerical overflow which comes from operating system. Maybe, we need to provide a more friendly approach to deal with it in the future. Gradient clipping strategy and proper parameter such as initialized weights and learning rate are useful to avoid that exception.
I'll close this issue, if there is an update, please reopen it.
I'm trying out the demo for Semantic Role Labeling from here and ran into some problems while running training.
I set the number of passes as 500 in
train.sh
. Here are the parameters I used for training after downloading the data -I'm using a linux virtual machine with 2 GB ram.
I encountered the following error twice, each time on the 120th pass -
The entire
train.log
file can be found here.I believe I can still use the 120th model checkpoint but am yet to try it out.
Why am I unable to train beyond this point and how do I overcome this?