We plan to use llama-api-server and llama-rag-server as the underlying wasm for a local web server. Therefore, I refactored the Backend code by defining a Trait to support multiple Backend implementations.
Currently, the api-server.wasm cannot be stopped, so the logic in use is still the original setup.
We plan to use llama-api-server and llama-rag-server as the underlying wasm for a local web server. Therefore, I refactored the Backend code by defining a Trait to support multiple Backend implementations.
Currently, the api-server.wasm cannot be stopped, so the logic in use is still the original setup.