lspvic / CopyNet

CopyNet Implementation with Tensorflow and nmt
123 stars 52 forks source link

ResourceExhaustedError (see above for traceback): OOM when allocating tensor with shape[100,32,367] #2

Open pyanh opened 6 years ago

pyanh commented 6 years ago

hello,

when I set the parameters as followed:

--num_layers=1 --num_nuits=32 --share_vocab=True --copynet=True --gen_vocab_size=500

my gpu is 12206Mib,it returns the error:

ResourceExhaustedError (see above for traceback): OOM when allocating tensor with shape[100,32,367] [[Node: dynamic_seq2seq/decoder/decoder/while/BasicDecoderStep/einsum/transpose = Transpose[T=DT_FLOAT, Tperm=DT_INT32, _device="/job:localhost/replica:0/task:0/device:GPU:0"](dynamic_seq2seq/decoder/decoder/while/BasicDecoderStep/einsum/transpose/Enter, dynamic_seq2seq/decoder/decoder/while/BasicDecoderStep/einsum_4/transpose/perm)]] [[Node: dynamic_seq2seq/decoder/decoder/while/BasicDecoderStep/TrainingHelperNextInputs/All/_131 = _HostRecvclient_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_219_dynamic_seq2seq/decoder/decoder/while/BasicDecoderStep/TrainingHelperNextInputs/All", tensor_type=DT_BOOL, _device="/job:localhost/replica:0/task:0/device:CPU:0"]]

I would like to ask how much memory is needed to run the copynet code?

mcjoshi commented 6 years ago

try reducing the batch_size to 8 (or) 4. It should work