trtserver: regionFormat.cpp:65: size_t nvinfer1::RegionFormatB::memorySize(int, const nvinfer1::Dims&) const: Assertion `batchSize > 0' failed.

zhangqijun commented 5 years ago

I use tf-trt int8 optimization model to start a server with nvcr.io/nvidia/tensorrtserver:19.07-py3. When I inference use simple_client.py got this error,and server down. But,inference use simple_client.py with same model with fp32 or fp16 precision is right. use same int8 optimization model with nvcr.io/nvidia/tensorrtserver:19.02-py3,inference is right. sorry about my Chinglish.

2019-08-13 06:52:23.686850: E tensorflow/core/grappler/optimizers/meta_optimizer.cc:525] layout failed: Invalid argument: The graph is already optimized by layout optimizer. 2019-08-13 06:52:24.241588: I tensorflow/compiler/tf2tensorrt/kernels/trt_engine_op.cc:632] Building a new TensorRT engine for TRTEngineOp_0 input shapes: [[8,112,112,3]] 2019-08-13 06:52:24.241834: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator GridAnchor_TRT Namespace 2019-08-13 06:52:24.241842: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator NMS_TRT Namespace 2019-08-13 06:52:24.241846: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator Reorg_TRT Namespace 2019-08-13 06:52:24.241849: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator Region_TRT Namespace 2019-08-13 06:52:24.241875: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator Clip_TRT Namespace 2019-08-13 06:52:24.241878: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator LReLU_TRT Namespace 2019-08-13 06:52:24.241881: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator PriorBox_TRT Namespace 2019-08-13 06:52:24.241884: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator Normalize_TRT Namespace 2019-08-13 06:52:24.241887: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator RPROI_TRT Namespace 2019-08-13 06:52:24.241890: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator BatchedNMS_TRT Namespace 2019-08-13 06:52:24.241893: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator GridAnchor_TRT Namespace 2019-08-13 06:52:24.241896: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator NMS_TRT Namespace 2019-08-13 06:52:24.241912: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator Reorg_TRT Namespace 2019-08-13 06:52:24.241917: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator Region_TRT Namespace 2019-08-13 06:52:24.241919: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator Clip_TRT Namespace 2019-08-13 06:52:24.241937: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator LReLU_TRT Namespace 2019-08-13 06:52:24.241941: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator PriorBox_TRT Namespace 2019-08-13 06:52:24.241944: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator Normalize_TRT Namespace 2019-08-13 06:52:24.241948: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator RPROI_TRT Namespace 2019-08-13 06:52:24.241951: E tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:41] DefaultLogger Could not register plugin creator BatchedNMS_TRT Namespace 2019-08-13 06:52:24.241955: E tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc:1097] Failed to initialize TensorRT plugins, and conversion may fail later. trtserver: regionFormat.cpp:65: size_t nvinfer1::RegionFormatB::memorySize(int, const nvinfer1::Dims&) const: Assertion `batchSize > 0' failed.

pb model is generated by this code: with graph.as_default(): with tf.Session(config=config) as sess: with tf.gfile.GFile("arcface.pb", "rb") as f: graph_def = tf.GraphDef() graph_def.ParseFromString(f.read()) trt_graph = trt.create_inference_graph(input_graph_def=graph_def, outputs=['embd_extractor/BatchNorm_1/Reshape_1'], max_batch_size=8, max_workspace_size_bytes=2 << 20, precision_mode="int8") import cv2 with tf.Session(graph=tf.Graph(),config=config) as sess: output_node = tf.import_graph_def(trt_graph, return_elements=['embd_extractor/BatchNorm_1/Reshape_1'])[0] images = tf.get_default_graph().get_tensor_by_name("import/Placeholder:0") output_tenser = tf.get_default_graph().get_tensor_by_name("import/embd_extractor/BatchNorm_1/Reshape_1:0") df = pd.read_csv("/media/ssd/predev/face_verifacation/dairy_test/dairy_verifacation_align.csv") for i in range(len(df)): img_path = os.path.join("/media/ssd/predev/face_verifacation/dairy_test/images_align/img",df.loc[i,"A1"]) img = cv2.imread(img_path).astype(np.float32)[:,:,::-1] print(sess.run(output_tenser,feed_dict={images:[img]}).shape) trt_int8_calibrated_graph = trt.calib_graph_to_infer_graph(trt_graph,is_dynamic_op=True) tf.train.write_graph(trt_int8_calibrated_graph, './', 'int8model.graphdef', as_text=False)

deadeyegoodwin commented 5 years ago

What does you model configuration look like (config.pbtxt).

deadeyegoodwin commented 5 years ago

Closing, please provide the requested information and re-open if you are still hitting the issue.

zhangqijun commented 5 years ago

this is config.pbtxt

name: "arcfaceint8" platform: "tensorflow_graphdef" max_batch_size: 16 input [ { name: "Placeholder" data_type: TYPE_FP32 dims: [ 112, 112 ,3] } ] output [ { name: "embd_extractor/BatchNorm_1/Reshape_1" data_type: TYPE_FP32 dims: [ 512 ] } ] dynamic_batching { preferred_batch_size: [ 8, 16 ] max_queue_delay_microseconds: 100 }

deadeyegoodwin commented 5 years ago

Did you generate the TF-TRT model using the same version of TensorRT as is being used by TRTIS? The easiest way to do this is to use the TensorRT container from the same release as the TRTIS you are using (for example, 19.07).

minh-h-ng commented 4 years ago

Any updates on this issue? I have the same issue on tensorflow/serving:1.14.0-gpu and nvcr.io/nvidia/tensorflow:19.10-py3.

helenHlz commented 2 years ago

I got the same error, when using tensorflow 1.15 + TF-TRT 5.1.5 Train and inference works fine. Here are some logs when do inference.

2022-06-29T09:11:29.852812877Z 2022-06-29 09:11:29.852690: I tensorflow/compiler/tf2tensorrt/kernels/trt_engine_op.cc:733] Building a new TensorRT engine for bert/embeddings/TRTEngineOp_0 input shapes: [[640]]
2022-06-29T09:11:29.852866360Z 2022-06-29 09:11:29.852799: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libnvinfer.so.5
2022-06-29T09:11:29.856544909Z 2022-06-29 09:11:29.856491: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libnvinfer_plugin.so.5
2022-06-29T09:11:31.201361074Z 2022-06-29 09:11:31.201237: I tensorflow/compiler/tf2tensorrt/kernels/trt_engine_op.cc:733] Building a new TensorRT engine for bert/embeddings/TRTEngineOp_1 input shapes: [[5,128,768], [5,128,768]] ...

But when I deploy the tf-trt int8 optimization model with some warm up data. This error happened.

2022-06-29 09:41:27.542443: I external/org_tensorflow/tensorflow/cc/saved_model/loader.cc:311] SavedModel load for tags { serve }; Status: success. Took 6986645 microseconds.
2022-06-29 09:41:27.587142: I tensorflow_serving/servables/tensorflow/saved_model_warmup.cc:117] Starting to read warmup data for model at /workspace/repository/savedmodel/group_7436/bert-int8-test/23/assets.extra/tf_serving_warmup_requests with model-warmup-options
2022-06-29 09:41:49.342088: I external/org_tensorflow/tensorflow/compiler/tf2tensorrt/kernels/trt_engine_op.cc:733] Building a new TensorRT engine for bert/embeddings/TRTEngineOp_0 input shapes: [[128]]
2022-06-29 09:41:49.665296: W external/org_tensorflow/tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:37] DefaultLogger Tensor DataType is determined at build time for tensors not marked as input or output. tensorflow_model_server: regionFormat.cpp:56: size_t nvinfer1::RegionFormatB::memorySize(int, const nvinfer1::Dims&) const: Assertion `batchSize > 0' failed.

The FP32 or FP16 optimization deploying works fine. Here is the log for FP16 optimization deploying.

2022-06-29 10:08:27.949798: I tensorflow_serving/servables/tensorflow/saved_model_warmup.cc:117] Starting to read warmup data for model at /workspace/repository/savedmodel/group_7436/bert-int8-test/25/assets.extra/tf_serving_warmup_requests with model-warmup-options
2022-06-29 10:08:49.022089: I external/org_tensorflow/tensorflow/compiler/tf2tensorrt/kernels/trt_engine_op.cc:733] Building a new TensorRT engine for bert/embeddings/TRTEngineOp_0 input shapes: [[128]]
2022-06-29 10:08:49.331724: W external/org_tensorflow/tensorflow/compiler/tf2tensorrt/utils/trt_logger.cc:37] DefaultLogger Tensor DataType is determined at build time for tensors not marked as input or output.
2022-06-29 10:08:50.570585: I external/org_tensorflow/tensorflow/compiler/tf2tensorrt/kernels/trt_engine_op.cc:733] Building a new TensorRT engine for bert/embeddings/TRTEngineOp_1 input shapes: [[1,128,768], [1,128,768], [1,128,768]]

This bug is really strange.

helenHlz commented 2 years ago

@zhangqijun Sorry to bother you, do you have a solution to this problem?

helenHlz commented 2 years ago

@minhdeal Sorry to bother you, do you have a solution to this problem?

triton-inference-server / server

trtserver: regionFormat.cpp:65: size_t nvinfer1::RegionFormatB::memorySize(int, const nvinfer1::Dims&) const: Assertion `batchSize > 0' failed. #550