FAILED models/test_weight_stripped_engine.py::TestWeightStrippedEngine::test_two_TRTRuntime_in_refitting - AssertionError: False is not true : TorchTensorRTModule outputs don't match with the original model. Cosine sim score: 0.0 Threshold: 0.99
I output refitted_output while using TorchTensorRTModule, which is all zeros, so it seems like the refitting was not successful.
Bug Description
The PR https://github.com/pytorch/TensorRT/pull/3167 is supporting weight-stripped engines, which works for
PythonTorchTensorRTModule
but not forTorchTensorRTModule
.I observed the issue in the test: https://github.com/pytorch/TensorRT/blob/76bdf5e0f0e5e0e31d5bc4cbf1bedfa5f4f4ea32/tests/py/dynamo/models/test_weight_stripped_engine.py#L487-L523
The CI test reports the error:
I output
refitted_output
while usingTorchTensorRTModule
, which is all zeros, so it seems like the refitting was not successful.