sogou / SogouMRCToolkit

This toolkit was designed for the fast and efficient development of modern machine comprehension models, including both published models and original prototypes.
Apache License 2.0
746 stars 162 forks source link

cudnn error #17

Open bigcat2333 opened 5 years ago

bigcat2333 commented 5 years ago

在运行bidaf_squadv2.py时,报出如下错误: tensorflow.python.framework.errors_impl.UnknownError: Failed to get convolution algorithm. This is probably because cuDNN failed to initialize, so try looking to see if a warning log message was printed above. [[node conv1d_1/conv1d/Conv2D (defined at G:\SMRCToolkit-master\sogou_mrc\nn\layers.py:115) = Conv2D[T=DT_FLOAT, data_format="NCHW", dilations=[1, 1, 1, 1], padding="VALID", strides=[1, 1, 1, 1], use_cudnn_on_gpu=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](gradients/conv1d_1/conv1d/Conv2D_grad/Conv2DBackpropFilter-0-TransposeNHWCToNCHW-LayoutOptimizer, conv1d_1/conv1d/ExpandDims_1)]] [[{{node add_18/_227}} = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_2587_add_18", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]()]]

Caused by op 'conv1d_1/conv1d/Conv2D', defined at: File "F:\Users\ylwang\Anaconda3\envs\SMRCToolkit-master\lib\runpy.py", line 183, in _run_module_as_main mod_name, mod_spec, code = _get_module_details(mod_name, _Error) 使用tensorflow版本为tensorflow-gpu==1.12,cuda9.0,cudnn7.0;怀疑是cudnn版本过低所以重新安装cudnn7.5.0,但问题依然存在,请问一下这个问题该如何解决呢?谢谢!

libertatis commented 5 years ago

@bigcat2333 要不你看一下这个issue,也许会对你有帮助~ https://github.com/tensorflow/tensorflow/issues/24828