Closed Bobo-y closed 8 months ago
The reason is roughly as follows: although the weights of the above layers are sparse, TRT found that implementing without sparsity is better than using sparsity. Therefore, even if the weights of some layers are sparse, sparse implementation will still not be used.
Can I ask what settings are needed to see this output? I set . /trtexec --onnx=model_weights_sparse.onnx --saveEngine=model_weights_sparse.trt --sparsity=enable --fp16 >result-model_weights_sparse-fp16.txt 2>&1
. But I don't see a similar output. Thank you!
Can I ask what settings are needed to see this output? I set
. /trtexec --onnx=model_weights_sparse.onnx --saveEngine=model_weights_sparse.trt --sparsity=enable --fp16 >result-model_weights_sparse-fp16.txt 2>&1
. But I don't see a similar output. Thank you!
may be use --verbose
Thank you.
above layer all traing with sparsity mask, but when convert to trt engine, only 3 layer can run with sparsity.