Closed pajkossy closed 8 years ago
I changed line 138 of word_tagger_dataset.py (if len(word) < 3; continue to if len(word)< 1, continue), when tring to train in the resulted dataset I got the error below (it is possible to train with the < 2 constraint)
Traceback (most recent call last):
File "hunvec/seqtag/trainer.py", line 123, in
HINT: Re-running with most Theano optimization disabled could give you a back-trace of when this node was created. This can be done with by setting the Theano flag 'optimizer=fast_compile'. If that does not work, Theano optimizations can be disabled with 'optimizer=None'. HINT: Use the Theano flag 'exception_verbosity=high' for a debugprint and storage map footprint of this apply node.
I think https://github.com/Theano/Theano/issues/3276 will solve the issue, so only an update of theano is needed, but still testing...
solved in #102
currently while preparing datasets very short sentences are dropped. even if training is not possible with them the test data could contain them so that test results are reliable