执行 paddle serving 报错

zouxiaoshi commented 2 years ago

问题：

Q1:

执行如下代码时报错：

export SERVING_BIN=/usr/local/serving_bin/serving
python -m paddle_serving_server.serve \
--model ./serving_server \
--thread 8 --port 10010 \
--gpu_ids 0

错误信息：

Error Message Summary:
----------------------
NotFoundError: Cannot open file ./serving_server/__model__, please confirm whether the file is normal.
  [Hint: Expected static_cast<bool>(fin.is_open()) == true, but received static_cast<bool>(fin.is_open()):0 != true:1.] (at /paddle/paddle/fluid/inference/api/analysis_predictor.cc:1119)

后用通过如下代码进行转换：

python -m paddle_serving_client.convert --dirname . \
                                         --model_filename model.pdmodel          \
                                         --params_filename model.pdiparams       \
                                         --serving_server ./serving_server/ \
                                         --serving_client ./serving_client/

得到如下文件：

.
├── model.pdiparams
├── model.pdmodel
├── serving_server_conf.prototxt
└── serving_server_conf.stream.prototxt

Q2:

强制对 model.pdmodel 重命名， mv model.pdmodel __model__ 然后启动 paddle serving 服务，得到如下错误：

SOLOv2 模型

Error Message Summary:
----------------------
UnavailableError: Load operator fail to open file ./serving_server/sync_batch_norm_48.w_1, please check whether the model file is complete or damaged.
  [Hint: Expected static_cast<bool>(fin) == true, but received static_cast<bool>(fin):0 != true:1.] (at /paddle/paddle/fluid/operators/load_op.h:41)
  [operator < load > error]

Yolov3 模型

Error Message Summary:
----------------------
UnavailableError: Load operator fail to open file ./serving_server/batch_norm_41.b_0, please check whether the model file is complete or damaged.
  [Hint: Expected static_cast<bool>(fin) == true, but received static_cast<bool>(fin):0 != true:1.] (at /paddle/paddle/fluid/operators/load_op.h:41)
  [operator < load > error]

YOLO V3 模型在以下环境下运行是可以的：

paddle-serving-app        0.6.1
paddle-serving-client     0.6.1
paddle-serving-server-gpu 0.6.1.post102
paddlepaddle-gpu          2.1.0

环境

paddle-serving-app        0.7.0
paddle-serving-client     0.7.0
paddle-serving-server-gpu 0.7.0.post102
paddlepaddle-gpu          2.2.0

cuda 10.2
Tesla V100
python 3.8

bjjwwang commented 2 years ago

这个SERVING_BIN是从哪里来的

TeslaZhao commented 2 years ago

fail to open file ./serving_server/batch_norm_41.b_0

从报错信息上看，你的模型是散列多文件的？

zouxiaoshi commented 2 years ago

这个SERVING_BIN是从哪里来的

是从 https://github.com/PaddlePaddle/Serving/blob/74a03152480ecd2ad7029873b92a4a71991b168e/tools/dockerfiles/build_scripts/install_whl.sh#L46 得来的。把其中的 serving_version 换成 0.7.0

zouxiaoshi commented 2 years ago

fail to open file ./serving_server/batch_norm_41.b_0 从报错信息上看，你的模型是散列多文件的？

您好。用的paddle detection上Yolo模型的脚本，训练后，导出的信息如下： . ├── infer_cfg.yml ├── model.pdiparams ├── model.pdiparams.info └── model.pdmodel

再通过如下命令得到 serving_server 以及 serving_client

python -m paddle_serving_client.convert --dirname . \
                                         --model_filename model.pdmodel          \
                                         --params_filename model.pdiparams       \
                                         --serving_server ./serving_server/ \
                                         --serving_client ./serving_client/

seving_server 信息如下：

.
├── fluid_time_file
├── model.pdiparams
├── model.pdmodel
├── serving_server_conf.prototxt
└── serving_server_conf.stream.prototxt

发生以下报错后

Error Message Summary:
----------------------
NotFoundError: Cannot open file ./serving_server/__model__, please confirm whether the file is normal.
  [Hint: Expected static_cast<bool>(fin.is_open()) == true, but received static_cast<bool>(fin.is_open()):0 != true:1.] (at /paddle/paddle/fluid/inference/api/analysis_predictor.cc:1119)

改名model: model.pdmodel ==> __model__ 再执行，得到如下报错：

Error Message Summary:
----------------------
UnavailableError: Load operator fail to open file ./serving_server/batch_norm_41.b_0, please check whether the model file is complete or damaged.
  [Hint: Expected static_cast<bool>(fin) == true, but received static_cast<bool>(fin):0 != true:1.] (at /paddle/paddle/fluid/operators/load_op.h:41)
  [operator < load > error]

bjjwwang commented 2 years ago

好的了解我这边复现一下大概2小时后给个结论。

bjjwwang commented 2 years ago

抱歉我这里没能复现。我在思考是不是SERVING_BIN的版本问题。可以unset SERVING_BIN再运行一下

python -m paddle_serving_server.serve \
--model ./serving_server \
--thread 8 --port 10010 \
--gpu_ids 0

wenjia322 commented 11 months ago

@zouxiaoshi @bjjwwang 抱歉打扰了，我遇到了同样的问题，请教一下当时是怎么解决的呢？

我的操作步骤是：

按照安装文档的1.2及2.1安装好了环境，并且在3环境检查时都是成功的。
按照模型转换文档下载了PaddleOCR的模型，但转换后的 ppocr_det_v3_serving 文件夹下并不是__model__ 和__params__文件名，而是 inference.pdmodel 和 inference.pdiparams ，这是为什么？

在执行：

python3 -m paddle_serving_server.serve --model ppocr_det_v3_serving  --port 8181

命令时，报错如下：

----------------------
Error Message Summary:
----------------------
NotFoundError: Cannot open file ppocr_det_v3_serving/__model__, please confirm whether the file is normal.
[Hint: Expected static_cast<bool>(fin.is_open()) == true, but received static_cast<bool>(fin.is_open()):0 != true:1.] (at /paddle/paddle/fluid/inference/api/analysis_predictor.cc:1452)

请问该如何解决？

PaddlePaddle / Serving

执行 paddle serving 报错 #1535

问题：

Q1:

Q2:

SOLOv2 模型

Yolov3 模型

环境

相关环境