mlc-ai / web-llm

High-performance In-browser LLM Inference Engine
https://webllm.mlc.ai
Apache License 2.0
12.29k stars 776 forks source link

[Tracking][WebLLM] Function calling (beta) and Embeddings #526

Open CharlieFRuan opened 1 month ago

CharlieFRuan commented 1 month ago

This issue tracks various action items we would like to complete with regard to the features function calling and embeddings.

Function calling (beta)

We are calling it beta because multiple iterations may be needed for function calling. It may be hard to conform different open-source models' function calling formats to OpenAI API. We will try to make each iteration non-breaking.

Embedding, Multi-model Engine, Concurrency

CharlieFRuan commented 1 month ago

Some future TODOs for embeddings: