Open dingidng opened 1 year ago
收到您的反馈,后续我们定位一下问题
我也出现这种情况:https://github.com/PaddlePaddle/PaddleNLP/issues/6418 同样的出现的环境如下:
paddlenlp 2.7.2 paddlepaddle-gpu 2.6.0 fast-tokenizer-python 1.0.2 fastapi 0.110.0 fastdeploy-gpu-python 0.0.0 fastdeploy-tools 0.0.5 报错如下: I0524 15:02:55.838413 1151823 allocator_facade.cc:435] Set default stream to 0x143f19e0 for StreamSafeCUDAAllocator(0xdd50af0) in Place(gpu:0) I0524 15:02:55.838426 1151823 allocator_facade.cc:373] Get Allocator by passing in a default stream I0524 15:02:55.838486 1151823 gpu_info.cc:224] [cudaMalloc] size=0.00244141 MB, result=0 I0524 15:02:55.838553 1151823 gpu_info.cc:224] [cudaMalloc] size=0.000244141 MB, result=0 I0524 15:02:55.838563 1151823 gpu_info.cc:224] [cudaMalloc] size=0.000244141 MB, result=0 I0524 15:02:55.838572 1151823 gpu_info.cc:224] [cudaMalloc] size=0.000244141 MB, result=0 I0524 15:02:55.838580 1151823 gpu_info.cc:224] [cudaMalloc] size=0.000244141 MB, result=0 I0524 15:02:55.838587 1151823 gpu_info.cc:224] [cudaMalloc] size=0.000244141 MB, result=0 I0524 15:02:55.838647 1151823 gpu_info.cc:224] [cudaMalloc] size=0.000244141 MB, result=0 I0524 15:02:55.838654 1151823 gpu_info.cc:224] [cudaMalloc] size=0.000244141 MB, result=0 I0524 15:02:55.838665 1151823 gpu_info.cc:224] [cudaMalloc] size=0.0732422 MB, result=0 I0524 15:02:55.839088 1151823 gpu_info.cc:224] [cudaMalloc] size=0.0288086 MB, result=0 I0524 15:02:55.839371 1151823 gpu_info.cc:224] [cudaMalloc] size=0.0732422 MB, result=0 I0524 15:02:55.839381 1151823 gpu_info.cc:224] [cudaMalloc] size=0.219727 MB, result=0 I0524 15:02:55.857205 1151823 gpu_info.cc:224] [cudaMalloc] size=0.248535 MB, result=0 I0524 15:02:55.859779 1151823 gpu_info.cc:224] [cudaMalloc] size=0.292969 MB, result=0 I0524 15:02:55.860016 1151823 gpu_info.cc:224] [cudaMalloc] size=0.292969 MB, result=0 I0524 15:02:55.860302 1151823 gpu_info.cc:224] [cudaMalloc] size=0.248535 MB, result=0 I0524 15:02:55.861150 1151823 stats.h:79] HostMemoryStatReserved0: Update current_value with 12, after update, current value = 12 I0524 15:02:55.861167 1151823 stats.h:79] HostMemoryStatAllocated0: Update current_value with 12, after update, current value = 12 I0524 15:02:55.861202 1151823 stats.h:79] HostMemoryStatReserved0: Update current_value with 4, after update, current value = 16 I0524 15:02:55.861207 1151823 stats.h:79] HostMemoryStatAllocated0: Update current_value with 4, after update, current value = 16 I0524 15:02:55.861232 1151823 stats.h:79] HostMemoryStatReserved0: Update current_value with 4, after update, current value = 20 I0524 15:02:55.861235 1151823 stats.h:79] HostMemoryStatAllocated0: Update current_value with 4, after update, current value = 20 Segmentation fault (core dumped)
软件环境
fastploy版本 1.0.7最新版
GPU版本
除使用
fastploy
外,所有程序均正常cudnn下载官网
操作系统:centos python版本:3.8
重复问题
错误描述
1.Fastploy部署时报错(仅报错)
Segmentation fault
检查了一下:在导入from paddlenlp.prompt import PromptDataCollatorWithPadding, UTCTemplate 这里出的错
稳定复现步骤 & 代码
https://github.com/PaddlePaddle/PaddleNLP/blob/develop/applications/zero_shot_text_classification
补充一下:在Windows系统下,没发现上述问题,可以正常执行,在linux centos下GPU V100 T4 以及CPU都会报错
Segmentation fault