lix19937 / tensorrt-insight

deep insight tensorrt
1 stars 0 forks source link
asp ptq qat tensorrt
topic 主题 备注
overview 概述
layout 内存布局
compute_graph_optimize 计算图优化
dynamic_shape 动态shape
plugin 插件
calibration 标定
asp 稀疏
qat 量化感知训练
trtexec 辅助工具
runtime 运行时
inferflow 模型调度
mps MPS
deploy 基于onnx部署流程, trt 工具使用
py-tensorrt python tensorrt封装 解析 tensorrt __init__
cookbook 食谱
developer_guide 开发者指导

tensorrt各版本迁移说明
https://docs.nvidia.com/deeplearning/tensorrt/migration-guide/index.html

Ref

https://docs.nvidia.com/deeplearning/tensorrt/archives/
https://developer.nvidia.com/search?page=1&sort=relevance&term=
https://github.com/HeKun-NVIDIA/TensorRT-Developer_Guide_in_Chinese/tree/main

https://developer.nvidia.com/zh-cn/blog/nvidia-gpu-fp8-training-inference/