Open WhatGhost opened 3 days ago
it seams that the lite is supported in the last commit: https://github.com/NVIDIA/TensorRT-LLM/blob/main/tensorrt_llm/models/deepseek_v2/convert.py#L355, you can try it.
it seams that the lite is supported in the last commit: https://github.com/NVIDIA/TensorRT-LLM/blob/main/tensorrt_llm/models/deepseek_v2/convert.py#L355, you can try it.
Thanks! i see it. I will try the latest commit
System Info
the offical docker env in docker/Dockerfile.multi of commit "c629546"
Who can help?
@byshiue @ncomly-nvidia I try to convert deepseek-v2-lite
and met the following error. It seems that the q_lora_rank in Lite is None and is different from DeepSeek-V2
I try to add the following code to tensorrt_llm/layers/attention.py and met new error
new error
It seems trtllm does not support Deepseek-V2-Lite.How can i solve this error.
Thanks very much!
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
see above
Expected behavior
see above
actual behavior
see above
additional notes
nothing