Closed StableFluffy closed 1 month ago
https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/models/qwen2_moe.py
Qwen2MoE is implemented in vLLM.
No response
It's added in the dev branch (RC 0.5.3). Will close this issue when merged into main.
🚀 The feature, motivation and pitch
https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/models/qwen2_moe.py
Qwen2MoE is implemented in vLLM.
Alternatives
No response
Additional context
No response