c++ inference - Githubissues

lucasjinreal commented 3 years ago

Hi, I have converted engine file, when I try inference the genereated trt engine file, there was an error:

Do u know what's the best way to found these plugins? I do have your amir plugins installed, but I don't know how to use it in c++, simply link it?

grimoire commented 3 years ago

Hi, Link the lib should be enough, if it doesn't work, invoke initLibAmirstanInferPlugins() in amirInferPlugin.h to load plugins manually. read this for detail.

lucasjinreal commented 3 years ago

@grimoire Wonna consult another question. For those model doesn't need plugin at all, such as resnet50 which is totally normal model, is that possible to generate engine without the plugins? I mean, for users to use, I want the engine attach plugins only if they really need one, for those normal model, users can directly using normal tensorrt lib to inference.

grimoire commented 3 years ago

Errr...Actually, tensorrt did not have GAP implementation. That's why resnet need plugins. here is the layers tensorrt provided https://docs.nvidia.com/deeplearning/tensorrt/api/python_api/infer/Graph/Layers.html#ipoolinglayer

You can use normal average pooling(if you don't need dynamic shape) or reduce instead of GAP.

read init.py, here is the layers that need plugins, avoid use them if you don't want to use plugins.


try:
    # custom plugin support
    from .GroupNorm import *
    from .repeat import *
    from .expand import *
    from .gather import *
    from .adaptive_avg_pool2d import *
    from .adaptive_max_pool2d import *
    from .AdaptiveAvgPool2d import *
    from .AdaptiveMaxPool2d import *
    from .meshgrid import convert_meshgrid
    from .grid_sample import convert_grid_sample
    from .flip import convert_flip
    from .cummax import convert_cummax
    from .cummin import convert_cummin
    from .cumsum import convert_cumsum
    from .cumprod import convert_cumprod
    from .expand_as import convert_expand_as
    from .deform_conv2d import convert_deform_conv2d
    from .nms import convert_nms
    from .roi_align import convert_roi_align, convert_RoiAlign
    from .roi_pool import convert_roi_pool, convert_RoIPool
except:
    print("plugin not found.")

lucasjinreal commented 3 years ago

@grimoire Did u mean, if the model doesn't have a plugin, torch2trt will not generate engine with plugin? AFAIK, using onnx2trt convert resnet doesn't invoke any plugin in it's engine.

grimoire commented 3 years ago

In this issue:

If you use TensorRT 7.*, the onnx operator GlobalAveragePool is implemented using IReduceLayer

IReduceLayer can do reduce along given dims, That's why onnx -> tensorrt doesn't need plugins.

But this repo also need to support mmdetection-to-tensorrt, Which need AdaptivePooling to do downsample(poolsize != 1), IReduceLayer can not help me with this task. And reduce the plugin usage is a good point. I will add a special case in GAP when poolsize=1 in future.

lucasjinreal commented 3 years ago

@grimoire I am not specific for resnet here, just take it as example. I want to know, it's that possible to build an engine with this tool without engine invoke plugins if this model doesn't need one (take vgg as example).

grimoire commented 3 years ago

If the model does not use the layers I mentioned above, the anwser is yes.

grimoire / torch2trt_dynamic

c++ inference #4