issues
search
lix19937
/
tensorrt-insight
Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda
12
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
哪些onnx op 会产生trt shuffle
#60
lix19937
opened
1 day ago
0
占用算力计算
#59
lix19937
opened
2 days ago
0
cudaMemcpy consuming CPU resources ?
#58
lix19937
opened
1 week ago
1
trt8510 not support topk with input is int or int64
#57
lix19937
opened
1 week ago
1
depthwise separable convolutions profile 深度可分离卷积评测
#56
lix19937
opened
1 week ago
0
how to inverse a permutation
#55
lix19937
opened
1 week ago
0
onnx models-with-external-data
#54
lix19937
opened
1 week ago
0
torch.onnx.errors.UnsupportedOperatorError: Exporting the operator 'aten::__iand_' to ONNX opset version 17 is not supported
#53
lix19937
opened
1 week ago
1
No importer registered for op: Inverse. Attempting to import as plugin, torch.inverse not support to onnx
#52
lix19937
opened
1 week ago
0
/usr/local/cuda-11.4/targets/aarch64-linux/include/crt/sm_80_rt.hpp(141): error: more than one instance of overloaded function "__nv_associate_access_property_impl" has "C" linkage
#51
lix19937
opened
2 weeks ago
0
bf16
#50
lix19937
opened
3 weeks ago
0
ASP for mm-x (mmcv mmdet mmseg) KeyError: <class 'mmcv.cnn.bricks.wrappers.Linear'>
#49
lix19937
opened
3 weeks ago
0
torch.randint usage in msda plugin
#48
lix19937
opened
1 month ago
0
RuntimeError: input_shape_value == reshape_value || input_shape_value == 1 || reshape_value == 1 INTERNAL ASSERT FAILED at "../torch/
#47
lix19937
opened
1 month ago
0
. vs source vs bash
#46
lix19937
opened
1 month ago
0
cuda kernel fp32 to fp16 precision loss ?
#45
lix19937
opened
1 month ago
0
How to determine if the torch layer can export an independent onnx node ?
#44
lix19937
opened
1 month ago
2
LayerNormalization support
#43
lix19937
opened
1 month ago
0
Unable to locate package <package>
#42
lix19937
opened
2 months ago
0
CUDA error code=35(cudaErrorInsufficientDriver) "cudaStreamCreateWithPriority(&stream, flags, priority)"
#41
lix19937
opened
2 months ago
0
trt cuda ctx vs engine 生命周期问题
#40
lix19937
opened
2 months ago
0
How to enable CUDAGraph for operations with dynamic control flow ?
#39
lix19937
opened
2 months ago
0
error while loading shared libraries: /usr/local/cuda/lib64/libcublasLt.so.11: file too short
#38
lix19937
opened
2 months ago
0
CUDA error code=35(cudaErrorInsufficientDriver) "cudaSetDevice(device)"
#37
lix19937
opened
2 months ago
1
CUDA error code=999(cudaErrorUnknown)
#36
lix19937
opened
3 months ago
0
nvcc fatal : Unknown option ‘fPIC’
#35
lix19937
opened
3 months ago
0
[runner.cpp::execute::718] Error Code 1: Myelin (Final synchronize failed (700))
#34
lix19937
opened
4 months ago
1
How to fully utilize GPU when use trt infer ?
#33
lix19937
opened
4 months ago
0
[runner.cpp::execute::718] Error Code 1: Myelin (Final synchronize failed (700))
#32
lix19937
closed
4 months ago
1
onnx 经过工具进行折叠优化后size变大分析
#31
lix19937
opened
4 months ago
0
h100 vs a100
#30
lix19937
opened
4 months ago
0
bug list
#29
lix19937
opened
4 months ago
0
some problems of QAT sample
#28
lix19937
opened
4 months ago
3
Floating point computing capacity not match with Orin-x's datasheet
#27
lix19937
opened
4 months ago
1
Only 11 arm cores and 4MB L3 cache observed in orin-Devkit
#26
lix19937
opened
4 months ago
1
nvidia tensorrt faq
#25
lix19937
opened
4 months ago
0
NVOnline - 1_19_2023 5_06_12 AM
#24
lix19937
opened
4 months ago
0
[06/27/2024-11:48:30] [E] Error[4]: [graphShapeAnalyzer.cpp::analyzeShapes::1872] Error Code 4: Miscellaneous (IShuffleLayer Reshape_966: reshape wildcard -1 has no integer solution. Reshaping [6,1000,8,32] to [6,334,-1].)
#23
lix19937
opened
4 months ago
0
Cuda failure: operation failed due to a previous error during capture
#22
lix19937
opened
4 months ago
3
Cuda Runtime (invalid resource handle)
#21
lix19937
opened
5 months ago
1
How to optimize slice
#20
lix19937
opened
5 months ago
1
trtexec build + infer passed, but arise error free(): invalid pointer
#19
lix19937
opened
5 months ago
0
reducesum+nonzero impl
#18
lix19937
opened
5 months ago
0
nonzero impl
#17
lix19937
opened
5 months ago
1
shell Syntax error: end of file unexpected (expecting “then“)
#16
lix19937
opened
5 months ago
0
squeeze(-1) 可能导致if 操作产生 可通过onnx 看到
#15
lix19937
opened
5 months ago
1
6060上直接更换trt8.6.0报错
#14
lix19937
opened
5 months ago
1
交叉编译时候 Relocations in generic ELF (EM: 62)
#13
lix19937
opened
5 months ago
1
TensorRT conversion fails with DCHECK(!i->is_use_only())
#12
lix19937
opened
6 months ago
1
drive os 6060 trt8510 support grid_sample or not ?
#11
lix19937
opened
6 months ago
1
Next