Open Minami-su opened 4 months ago
DeepSeek V2 is a state-of-the-art moe model. Are there any plans to support this model?
DeepSeek V2 is a state-of-the-art moe model. Are there any plans to support this model?