Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Note that the issue tracker is NOT the place for general support.
I deployed a model, but encountered the problem of garbled Chinese, what is the reason? such as:
this is my model.json
this is huggingface url: https://huggingface.co/FlagAlpha/Llama2-Chinese-7b-Chat