-
Please add basic performance stats: prompt processing tokens/s, generation tokens/s behind a key like -vs
Also a mode for debugging LLM API requests (log as json) would be useful behind a key like -v…
-
### What happened?
Since the commit #8006 GGML is now compiled as Dynamic library (vs static library, before).
I can't find any option to reintroduce the previous mode. There is a GGML_STATIC opti…
-
We need a mechanism that recognizes common entries from different data sources with different data quality and combines them.
## Starting situation
We assume that we have already translated data…
-
c.c @fakezeta
**LocalAI version:**
quay.io/go-skynet/local-ai@sha256:4e4e427433285b056f32bfaa313ec0e75aeacb5b5c8c273953f9d2242fb55a60
**Environment, CPU architecture, OS, and Version:**
Linux…
-
Hi,
I'm using Langflow to create a ChatBot based on Mistral 7B, but i can't find any documentation or example of the module "Hugging Face API" on Langflow, and what are the exact values to put in End…
-
As https://github.com/ggerganov/llama.cpp/pull/6829 (great job llama.cpp!) is in, should be possible to extend our grpc server to distribute the workload to workers.
From a quick look the upstream im…
-
**LocalAI version:**
v1.25.0
**Environment, CPU architecture, OS, and Version:**
Linux hostname 5.15.0-78-generic #85-Ubuntu SMP Fri Jul 7 15:25:09 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
…
-
**LocalAI version:**
v2.12.4-aio-gpu-nvidia-cuda-12
**Environment, CPU architecture, OS, and Version:**
Linux giancubuntu 5.15.0-105-generic #115-Ubuntu SMP Mon Apr 15 09:52:04 UTC 2024 x86_6…
-
### Self Checks
- [X] This is only for bug report, if you would like to ask a quesion, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general).
- [X] I have se…
-
### Checklist
- [X] I've searched for similar issues and couldn't find anything matching
- [X] I've included steps to reproduce the behavior
### Affected Components
- [ ] K8sGPT (CLI)
- [X]…