Closed gkiril closed 6 years ago
Thanks for reporting this. It could be the difference in Theano version. Looks like they've changed the API in the 1.0 version: http://deeplearning.net/software/theano/library/sandbox/rng_mrg.html
As a quick fix, could you please try removing the "use_cuda" argument? No guarantee for this though.
I'm a bit busy lately, but will look into it when I have more time.
Thanks for your quick answer.
I tried removing the use_cuda
argument. It seems to be working, though some warning is shown on the console:
deep_srl/python/neural_srl/theano/tagger.py:94: UserWarning: theano.function was asked to create a function computing outputs given certain inputs, but the provided input variable at index 2 is not part of the computational graph needed to compute the outputs: <TensorType(int8, scalar)>.
To make this warning into an error, you can pass the parameter on_unused_input='raise' to theano.function. To disable it completely, use on_unused_input='ignore'.
givens=({self.is_train: numpy.cast['int8'](0)}))
Suggestion: could be helpful for other people if this note is added to the README.
Cheers!
Hi Luheng,
Approximately how much time it was needed to train conll05_model and conll05_propid_model? What was the hardware you used for training ? Or what is ideal hardware we need for training deep_srl ?
Thanks and regards, Rakesh Malviya
I used a Titan X GPU. For the propid model it's about an hour (or less?). for conll05_model it's about a week, but it gets pretty good result after about 24h. Compiling the 8 layer model for the first time (if you use FAST_RUN option) takes about 8 hours due to the variational dropout layer.
I was following the instructions in the README file. However, when I try to run the interactive console with
python python/interactive.py --model conll05_model/ --pidmodel conll05_propid_model
, I get the following error:This is probably some Theano issue (although I already installed it as suggested in your tutorial).
Any idea of how this can be fixed?