Closed penguinshin closed 8 years ago
if I recall correctly I think the previous train_from error was not an empty tensor error. Can you post your command line as well as the error it gives?
Thanks for responding yoon! I just looked at the previous post I made and it was an empty tensor error.
Here is the new one. Keep in mind that my run_model2.py encodes all the command line things.
Would i be able to send you this file over email?
Austins-MacBook-Pro:src2anno austinshin$ python run_model2.py
loading data...
done!
Source vocab size: 8814, Target vocab size: 15669
Source max sent len: 50, Target max sent len: 52
loading django-model_final1.t7...
Number of parameters: 14535669
/Users/austinshin/torch/install/bin/luajit: bad argument #1 to '?' (empty tensor at /Users/austinshin/torch/pkg/torch/generic/Tensor.c:888)
stack traceback:
[C]: at 0x0f9afad0
[C]: in function '__index'
/Users/austinshin/torch/install/share/lua/5.1/nn/MM.lua:51: in function 'updateGradInput'
...stinshin/torch/install/share/lua/5.1/nngraph/gmodule.lua:386: in function 'neteval'
...stinshin/torch/install/share/lua/5.1/nngraph/gmodule.lua:420: in function 'updateGradInput'
...stinshin/torch/install/share/lua/5.1/nngraph/gmodule.lua:386: in function 'neteval'
...stinshin/torch/install/share/lua/5.1/nngraph/gmodule.lua:420: in function 'updateGradInput'
/Users/austinshin/torch/install/share/lua/5.1/nn/Module.lua:31: in function 'backward'
train.lua:370: in function 'train_batch'
train.lua:479: in function 'train'
train.lua:646: in function 'main'
train.lua:649: in main chunk
[C]: in function 'dofile'
...shin/torch/install/lib/luarocks/rocks/trepl/scm-1/bin/th:145: in main chunk
[C]: at 0x010f647bc0
ah ok. yes, please send the file over email!
thanks!
On May 1, 2016, at 10:35 PM, Yoon Kim notifications@github.com wrote:
ah ok. yes, please send the file over email!
— You are receiving this because you authored the thread. Reply to this email directly or view it on GitHub https://github.com/harvardnlp/seq2seq-attn/issues/20#issuecomment-216094706
let me know if you get the file. im not sure if it sent
hmm i didn't get it...
here it is as a text file, just change the extension to py run_model2.txt
hmm can you add the following two lines and print out the log here?
line 616: print(opt.num_layers, model_opt.num_layers) line 369: print(#drnn_state_dec)
Austins-MacBook-Pro:src2anno austinshin$ python run_model2.py
loading data...
done!
Source vocab size: 8814, Target vocab size: 15669
Source max sent len: 50, Target max sent len: 52
loading django-model_final1.t7...
2 2
Number of parameters: 14535669
5
/Users/austinshin/torch/install/bin/luajit: bad argument #1 to '?' (empty tensor at /Users/austinshin/torch/pkg/torch/generic/Tensor.c:888)
stack traceback:
On May 1, 2016, at 10:50 PM, Yoon Kim notifications@github.com wrote:
print(opt.num_layers, model_opt.num_layers)
huh, weird. that looks right. let's debug this in person tomorrow? i should be in my office most of the afternoon
sounds good. wheres your office?
On May 1, 2016, at 11:17 PM, Yoon Kim notifications@github.com wrote:
print(opt.num_layers, model_opt.num_layers)
MD208
I printed out decoder_input (line 370) does this look right?
Austins-MacBook-Pro:src2anno austinshin$ python run_model2.py
loading data...
done!
Source vocab size: 8814, Target vocab size: 15669
Source max sent len: 50, Target max sent len: 52
loading django-model_final1.t7...
2 2
Number of parameters: 14535669
5
{
1 : LongTensor - size: 64
2 : DoubleTensor - size: 64x12x100
3 : DoubleTensor - size: 64x100
4 : DoubleTensor - size: 64x100
5 : DoubleTensor - size: 64x100
6 : DoubleTensor - size: 64x100
7 : DoubleTensor - size: 64x100
}
/Users/austinshin/torch/install/bin/luajit: bad argument #1 to '?' (empty tensor at /Users/austinshin/torch/pkg/torch/generic/Tensor.c:888)
stack traceback:
It’s odd because when I run a new training (when i don’t have the train_from) I get this (along with no error):
{ 1 : LongTensor - size: 64 2 : DoubleTensor - size: 64x6x100 3 : DoubleTensor - size: 64x100 4 : DoubleTensor - size: 64x100 5 : DoubleTensor - size: 64x100 6 : DoubleTensor - size: 64x100 7 : DoubleTensor - size: 64x100 }
So for some reason, when I do train_from, the value of source_l is 12 instead of 6 for normal training (without train_from). i think this might be the issue
On May 1, 2016, at 11:18 PM, Yoon Kim notifications@github.com wrote:
MD208
— You are receiving this because you authored the thread. Reply to this email directly or view it on GitHub https://github.com/harvardnlp/seq2seq-attn/issues/20#issuecomment-216098419
Hi yoon I just stopped by your office- will Be in md for the next hour or so let me know if your here!
Sent from my iPhone
On May 1, 2016, at 11:18 PM, Yoon Kim notifications@github.com wrote:
MD208
— You are receiving this because you authored the thread. Reply to this email directly or view it on GitHub
hey sorry im in my office now!
How do you fix this problem? I got a same one?
Hi, sorry for not replying! I am still having the same issue where loading a model gives the empty tensor error. have you had this issue still?