Closed johndun closed 9 months ago
Thanks for pointing out this issue and the workaround. I'll take a look today.
Ok it turns out that we should do exactly what you proposed. Mistral config does use MistralForCausalLM
, so we should look for this class instead of using a hard-coded mapping. I'll file a PR for it now and make you a co-author. Thanks!
The
get_model_cls_by_arch_name
introduced in Dynamic model class loading PR removes the hard-coded mapping betweenMistralForCausalLM
andLlamaForCausalLM
causing issues trying to local host Mistral-7b model as of sglang version 0.1.9. I have tested that adding the following simplemodels/mistral.py
file allows hosting the mistral-7b model.