Open ReginaZh opened 4 days ago
@ReginaZh Could you try to use the latest main branch? We have fixed it.
@ReginaZh This issue is fixed but has not been merged into main yet. Before that, you can try this hot fix:
Add from collections import Counter
to https://github.com/NVIDIA/TensorRT-LLM/blob/main/tensorrt_llm/models/modeling_utils.py and the following after this line
repeated_ptrs = [
key for key, value in dict(Counter(weights_ptrs)).items()
if value > 1
]
for key in weights:
if weights[key].data_ptr() in repeated_ptrs:
weights[key] = weights[key].clone().detach()```
System Info
A100
Who can help?
No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Expected behavior
Successfully convert and save model checkpoints
actual behavior
additional notes
transformer version: 4.42.3 TensorRT-LLM version: "0.15.0.dev2024111200"