Open flexwang opened 12 months ago
My understanding is that we have to build a fastAPI wrapper, and during intialized phase we call client = mii.client("mistralai/Mistral-7B-v0.1") and we implement a handler to call client.generate.
client = mii.client("mistralai/Mistral-7B-v0.1")
client.generate
You can use the RESTFul API
Also, I added a PR #317 (still in progress) to implement an OpenAI-compatible RESTful API
My understanding is that we have to build a fastAPI wrapper, and during intialized phase we call
client = mii.client("mistralai/Mistral-7B-v0.1")
and we implement a handler to callclient.generate
.