isarsoft / yolov4-triton-tensorrt

This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server
http://www.isarsoft.com
Other
276 stars 63 forks source link

unexpected inference input 'data' #54

Closed ontheway16 closed 2 years ago

ontheway16 commented 2 years ago

Just built everything from scratch, and set the batch to 16 in main.cpp, but getting the following error;

+------------------+---------+---------------------------------------------------------------------------------------------+
| Model            | Version | Status                                                                                      |
+------------------+---------+---------------------------------------------------------------------------------------------+
| resnet50_pytorch | 1       | READY                                                                                       |
| yolov4           | 1       | UNAVAILABLE: Invalid argument: unexpected inference input 'data', allowed inputs are: input |
+------------------+---------+---------------------------------------------------------------------------------------------+

I1013 07:49:41.680190 1 plan_backend.cc:365] Creating instance yolov4_0_0_gpu0 on GPU 0 (6.1) using model.plan
I1013 07:49:41.686045 1 logging.cc:52] Allocated persistent device memory of size 422353920
I1013 07:49:41.693108 1 logging.cc:52] Allocated activation device memory of size 2401337344
I1013 07:49:41.693283 1 logging.cc:52] Assigning persistent memory blocks for various profiles
I1013 07:49:41.693315 1 plan_backend.cc:608] Detected input as execution binding for yolov4
I1013 07:49:41.693322 1 plan_backend.cc:608] Detected detections as execution binding for yolov4
I1013 07:49:41.693330 1 plan_backend.cc:161] ~PlanBackend::Context 
E1013 07:49:41.747686 1 model_repository_manager.cc:1242] failed to load 'yolov4' version 1: Invalid argument: unexpected inference input 'data', allowed inputs are: input
I1013 07:49:41.747724 1 model_repository_manager.cc:1008] TriggerNextAction() 'yolov4' version 1: 0
I1013 07:49:41.747732 1 model_repository_manager.cc:1023] no next action, trigger OnComplete()
I1013 07:49:41.747806 1 model_repository_manager.cc:612] VersionStates() 'yolov4'
I1013 07:49:41.960221 1 dynamic_batch_scheduler.cc:230] Starting dynamic-batch scheduler thread 0 at nice 5...
ontheway16 commented 2 years ago

Ok solved,

name: "yolov4"
platform: "tensorrt_plan"
max_batch_size: 1
input [
  {
    name: "input"
    data_type: TYPE_FP32
    format: FORMAT_NCHW
    dims: [ 3, 608, 608 ]
  }
]
output [
  {
    name: "detections"
    data_type: TYPE_FP32
    dims: [159201, 1, 1]
  }
]
philipp-schmidt commented 2 years ago

Yes, the newest version has changes, you need to adapt a few things or check out release 1.3.