BrikerMan / Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
http://kashgari.readthedocs.io/
Apache License 2.0
2.39k stars 441 forks source link

[Question] 您好,在cuda10下运行模型,报错Fail to find the dnn implementation.谢谢 #168

Closed wyqnumber closed 5 years ago

wyqnumber commented 5 years ago

Epoch 1/20 2019-07-19 10:34:06.768267: I tensorflow/stream_executor/dso_loader.cc:152] successfully opened CUDA library libcublas.so.10.0 locally 2019-07-19 10:34:07.669769: E tensorflow/stream_executor/cuda/cuda_dnn.cc:334] Could not create cudnn handle: CUDNN_STATUS_INTERNAL_ERROR 2019-07-19 10:34:07.669826: W tensorflow/core/framework/op_kernel.cc:1401] OP_REQUIRES failed at cudnn_rnn_ops.cc:1217 : Unknown: Fail to find the dnn implementation. 2019-07-19 10:34:07.680118: E tensorflow/stream_executor/cuda/cuda_dnn.cc:334] Could not create cudnn handle: CUDNN_STATUS_INTERNAL_ERROR 2019-07-19 10:34:07.680151: W tensorflow/core/framework/op_kernel.cc:1401] OP_REQUIRES failed at cudnn_rnn_ops.cc:1217 : Unknown: Fail to find the dnn implementation. Traceback (most recent call last): File "/home/wyq/PycharmProjects/MyProject/201907/Practice/kashgari/ner.py", line 32, in batch_size=2) File "/home/wyq/Public/SoftWare/anaconda3/lib/python3.6/site-packages/kashgari_tf-0.5.1-py3.6.egg/kashgari/tasks/base_model.py", line 295, in fit File "/home/wyq/Public/SoftWare/anaconda3/lib/python3.6/site-packages/tensorflow/python/keras/engine/training.py", line 1426, in fit_generator initial_epoch=initial_epoch) File "/home/wyq/Public/SoftWare/anaconda3/lib/python3.6/site-packages/tensorflow/python/keras/engine/training_generator.py", line 191, in model_iteration batch_outs = batch_function(*batch_data) File "/home/wyq/Public/SoftWare/anaconda3/lib/python3.6/site-packages/tensorflow/python/keras/engine/training.py", line 1191, in train_on_batch outputs = self._fit_function(ins) # pylint: disable=not-callable File "/home/wyq/Public/SoftWare/anaconda3/lib/python3.6/site-packages/tensorflow/python/keras/backend.py", line 3076, in call run_metadata=self.run_metadata) File "/home/wyq/Public/SoftWare/anaconda3/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1439, in call run_metadata_ptr) File "/home/wyq/Public/SoftWare/anaconda3/lib/python3.6/site-packages/tensorflow/python/framework/errors_impl.py", line 528, in exit c_api.TF_GetCode(self.status.status)) tensorflow.python.framework.errors_impl.UnknownError: Fail to find the dnn implementation. [[{{node layer_blstm/CudnnRNN}}]] [[{{node layer_crf/rnn/while/Switch_1}}]]

BrikerMan commented 5 years ago

请按照 issue 模板提供一下系统信息,软件版本,关键代码。再重新开个 issue 或者把这个 issue 的第一个 comment 更新一下。