Closed m0ckingbird23 closed 5 years ago
@lylk23 Did you use the same version of tensorflow in requirement.txt. Seems you met the error when you try to get next batch data:)
no i am using tf 1.10.0 i should use the tf 1.12.0 ??
@lylk23 You may upgrade your Tensorflow and check if the problem still exist:)
well i just did and this is what i get now:
I0701 12:26:13.678080 6510 train_shadownet.py:572] Use single gpu to train the model
2019-07-01 12:26:16.445877: I tensorflow/core/platform/cpu_feature_guard.cc:141] Your CPU supports instructions that this TensorFlow binary was not compiled to use: SSE4.1 SSE4.2 AVX AVX2 FMA
2019-07-01 12:26:16.595548: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:964] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2019-07-01 12:26:16.596396: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1432] Found device 0 with properties:
name: GeForce GTX 780M major: 3 minor: 0 memoryClockRate(GHz): 0.797
pciBusID: 0000:01:00.0
totalMemory: 3.94GiB freeMemory: 3.61GiB
2019-07-01 12:26:16.596419: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1511] Adding visible gpu devices: 0
Traceback (most recent call last):
File "tools/train_shadownet.py", line 578, in <module>
need_decode=args.decode_outputs
File "tools/train_shadownet.py", line 260, in train_shadownet
sess = tf.Session(config=sess_config)
File "/home/khalyl/anaconda3/envs/myenv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1551, in __init__
super(Session, self).__init__(target, graph, config=config)
File "/home/khalyl/anaconda3/envs/myenv/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 676, in __init__
self._session = tf_session.TF_NewSessionRef(self._graph._c_graph, opts)
tensorflow.python.framework.errors_impl.InternalError: cudaGetDevice() failed. Status: CUDA driver version is insufficient for CUDA runtime version
@lylk23 Reinstall your CUDA driver. You may google how to install it:)
the problem is that i cant upgrade my CUDA driver cuz it wont be compatible with my graphic card.
@lylk23 Maybe you need to upgrade your hard device or find another dataset input solution with tensorflow 1.10:)
Hello , i tried to re-tain the model using the English dataset (synth90k), after converting the data to the tensorflow records and after launching the training , i got this message :
i was wondering what's wrong i didn't change anything , is it a CPU issue or tensorflow one? and do you have any idea how can i fix it ?