Franc-Z / QWen1.5_TensorRT-LLM

Optimize QWen1.5 models with TensorRT-LLM
Apache License 2.0
15 stars 3 forks source link

support CodeQwen1.5-7b-chat? #4

Open elegant-bot opened 4 months ago

elegant-bot commented 4 months ago

get this error: AssertError:QWen uses MHA

Franc-Z commented 4 months ago

Could you give your building script ?