DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
982
stars
48
forks
source link
Can you provide the inference version of DeepSeek based on vllm, deepspeed and tensorrt-llm #23
Closed
Eutenacity closed 7 months ago
Can you provide the inference version of DeepSeek based on vllm, deepspeed and tensorrt-llm