Guanghan / ROLO

ROLO is short for Recurrent YOLO, aimed at simultaneous object detection and tracking
Apache License 2.0
883 stars 278 forks source link

Ran out of memory #23

Open MansourTrabelsi opened 7 years ago

MansourTrabelsi commented 7 years ago

Hello everybody,

With my Ubuntu 14.04, Nvidia 740M, and Cuda 7.5

i run " python ./experiments/testing/ROLO_network_test_all.py " which caused the problem mentioned here. Any solutions proposed please. Thank you a lot.

___***xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx W tensorflow/core/common_runtime/bfc_allocator.cc:271] Ran out of memory trying to allocate 513.50MiB. See logs for memory state. W tensorflow/core/framework/op_kernel.cc:900] Resource exhausted: OOM when allocating tensor with shape[8204,16408] Traceback (most recent call last): File "./experiments/testing/ROLO_network_test_all.py", line 273, in main(' ') File "./experiments/testing/ROLO_network_test_all.py", line 269, in main ROLO_TF(argvs) File "./experiments/testing/ROLO_network_test_all.py", line 93, in init self.ROLO(argvs) File "./experiments/testing/ROLO_network_test_all.py", line 264, in ROLO self.testing(x_path, y_path) File "./experiments/testing/ROLO_network_test_all.py", line 154, in testing sess.run(init) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/client/session.py", line 340, in run run_metadata_ptr) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/client/session.py", line 564, in _run feed_dict_string, options, run_metadata) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/client/session.py", line 637, in _do_run target_list, options, run_metadata) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/client/session.py", line 659, in _do_call e.code) tensorflow.python.framework.errors.ResourceExhaustedError: OOM when allocating tensor with shape[8204,16408] [[Node: RNN/LSTMCell/W_0/Initializer/random_uniform/RandomUniform = RandomUniformT=DT_INT32, _class=["loc:@RNN/LSTMCell/W_0"], dtype=DT_FLOAT, seed=0, seed2=0, _device="/job:localhost/replica:0/task:0/gpu:0"]] Caused by op u'RNN/LSTMCell/W_0/Initializer/random_uniform/RandomUniform', defined at: File "./experiments/testing/ROLO_network_test_all.py", line 273, in main(' ') File "./experiments/testing/ROLO_network_test_all.py", line 269, in main ROLO_TF(argvs) File "./experiments/testing/ROLO_network_test_all.py", line 93, in init self.ROLO(argvs) File "./experiments/testing/ROLO_network_test_all.py", line 233, in ROLO self.build_networks() File "./experiments/testing/ROLO_network_test_all.py", line 125, in build_networks self.lstm_module = self.LSTM_single('lstm_test', self.x, self.istate, self.weights, self.biases) File "./experiments/testing/ROLO_network_test_all.py", line 108, in LSTM_single outputs, state = tf.nn.rnn(cell, [_X[step]], state) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/rnn.py", line 143, in rnn (output, state) = call_cell() File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/rnn.py", line 136, in callcell = lambda: cell(input, state) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/rnn_cell.py", line 352, in call dtype, self._num_unit_shards) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/rnn_cell.py", line 216, in _get_concat_variable sharded_variable = _get_sharded_variable(name, shape, dtype, num_shards) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/rnn_cell.py", line 246, in _get_sharded_variable dtype=dtype)) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/variable_scope.py", line 339, in get_variable collections=collections) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/variable_scope.py", line 262, in get_variable collections=collections, caching_device=caching_device) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/variable_scope.py", line 158, in get_variable dtype=variable_dtype) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/variables.py", line 209, in init__ dtype=dtype) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/variables.py", line 275, in _init_from_args self._initial_value = ops.convert_to_tensor(initial_value(), File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/variable_scope.py", line 149, in init_val = lambda: initializer(shape.as_list(), dtype=dtype) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/init_ops.py", line 200, in _initializer dtype, seed=seed) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/random_ops.py", line 182, in random_uniform seed2=seed2) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/gen_random_ops.py", line 96, in _random_uniform seed=seed, seed2=seed2, name=name) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/op_def_library.py", line 655, in apply_op op_def=op_def) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/framework/ops.py", line 2154, in create_op original_op=self._default_original_op, op_def=op_def) File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/framework/ops.py", line 1154, in init self._traceback = _extract_stack()