PygmalionAI / aphrodite-engine

PygmalionAI's large-scale inference engine
https://pygmalion.chat
GNU Affero General Public License v3.0
722 stars 85 forks source link

[Feature]: Add support for Qwen2MoE #387

Closed StableFluffy closed 1 month ago

StableFluffy commented 2 months ago

🚀 The feature, motivation and pitch

https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/models/qwen2_moe.py

Qwen2MoE is implemented in vLLM.

Alternatives

No response

Additional context

No response

AlpinDale commented 2 months ago

It's added in the dev branch (RC 0.5.3). Will close this issue when merged into main.