Open shekhars-li opened 7 months ago
Update: I see the latest release 0.7.1 does not support FalconForCausalLM
in MODEL_MAP yet. I do not have an option to compile from source as I can only push a precompiled docker image and not run the compilation on the cluster with A100. Can you please create a new release with the latest changes that support the FalconForCausalLM
architecture too?
As a final attempt, I tried to install unreleased version myself
pip install tensorrt-llm==0.9.0.dev2024020600 --extra-index-url https://pypi.nvidia.com
And I am unable to install also:
INFO: pip is looking at multiple versions of tensorrt-llm to determine which version is compatible with other requirements. This could take a while.
ERROR: Cannot install tensorrt-llm because these package versions have conflicting dependencies.
The conflict is caused by:
nvidia-ammo 0.7.3 depends on torchprofile>=0.0.4
nvidia-ammo 0.7.2 depends on torchprofile>=0.0.4
nvidia-ammo 0.7.1 depends on onnxruntime>=1.16.1
nvidia-ammo 0.7.0 depends on onnxruntime>=1.16.1
To fix this you could try to:
1. loosen the range of package versions you've specified
2. remove package versions to allow pip attempt to solve the dependency conflict
System Info
Who can help?
@kaiyux @byshiue
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Convert HF weights:
Compile engine:
Expected behavior
Engine compiles successfullly
actual behavior
trtllm-build returns
additional notes
I am following very simple standard script from the repo. The weights are HF weights. The build is simple too. I already have pods with all the dependencies installed. I verified tensorrt-llm can be loaded/used in python repr.