terrifyzhao / bert-utils

一行代码使用BERT生成句向量,BERT做文本分类、文本相似度计算
Apache License 2.0
1.65k stars 425 forks source link

用gpu环境跑出问题 #28

Open mllwm opened 5 years ago

mllwm commented 5 years ago

您好,我只需要得到句向量,但是在GPU环境下出问题了,您能帮解决一下吗?没有报错,但是程序无法继续运行,在cpu环境下没有问题,谢谢。

2019-06-03 11:20:11.971647: W tensorflow/compiler/xla/service/gpu/llvm_gpu_backend/nvptx_backend_lib.cc:134] Unknown compute capability (7, 5) .Defaulting to telling LLVM that we're compiling for sm_30 2019-06-03 11:20:13.429726: W tensorflow/compiler/xla/service/gpu/llvm_gpu_backend/nvptx_backend_lib.cc:105] Unknown compute capability (7, 5) .Defaulting to libdevice for compute_20 2019-06-03 11:20:13.448273: W tensorflow/core/framework/op_kernel.cc:1401] OP_REQUIRES failed at xla_ops.cc:429 : Not found: ./libdevice.compute_20.10.bc not found

terrifyzhao commented 5 years ago

@mllwm 请提供更详细的log信息

mllwm commented 5 years ago

错误如下,非常感谢 tensorflow/compiler/xla/service/gpu/nvptx_compiler.cc:794] Failed to compile ptx to cubin. Will attempt to let GPU driver compile the ptx. Not found: /usr/local/cuda-9.0/bin/ptxas not found
2019-06-04 15:10:33.384438: W tensorflow/core/framework/op_kernel.cc:1273] OP_REQUIRES failed at xla_ops.cc:408 : Not found: ./libdevice.compute_30.10.bc not found

------------------ 原始邮件 ------------------ 发件人: "joe.zhao"notifications@github.com; 发送时间: 2019年6月25日(星期二) 下午2:20 收件人: "terrifyzhao/bert-utils"bert-utils@noreply.github.com; 抄送: "马乐乐"1659124250@qq.com;"Mention"mention@noreply.github.com; 主题: Re: [terrifyzhao/bert-utils] 用gpu环境跑出问题 (#28)

@mllwm 请提供更详细的log信息

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.

houqinju2016 commented 5 years ago

出现了同样的问题

lixiangtnt commented 5 years ago

出现了同样的问题

Roshanson commented 4 years ago

sudo cp libdevice.10.bc libdevice.compute_20.10.bc export XLA_FLAGS="--xla_gpu_cuda_data_dir=/usr/local/cuda/"

fayandjeanie commented 4 years ago

出現了同樣的問題。 用export XLA_FLAGS="--xla_gpu_cuda_data_dir=/usr/local/cuda/"也無法修正。 而且我的CUDA不是9.0的

2019-12-12 15:13:16.294571: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2793275000 Hz 2019-12-12 15:13:16.298611: I tensorflow/compiler/xla/service/service.cc:149] XLA service 0x7f38780abb80 executing computations on platform Host. Devices: 2019-12-12 15:13:16.298757: I tensorflow/compiler/xla/service/service.cc:157] StreamExecutor device (0): <undefined>, <undefined> 2019-12-12 15:13:16.606605: W tensorflow/compiler/xla/service/gpu/nvptx_compiler.cc:794] Failed to compile ptx to cubin. Will attempt to let GPU driver compile the ptx. Not found: /usr/local/cuda-9.0/bin/ptxas not found 2019-12-12 15:13:18.181859: W tensorflow/core/framework/op_kernel.cc:1273] OP_REQUIRES failed at xla_ops.cc:408 : Not found: ./libdevice.compute_30.10.bc not found