-
I upgraded to the last version of privateGPT and the ingestion speed is much slower than in previous versions. It is so slow to the point of being unusable.
I use the recommended ollama possibility.…
-
- Base repository is forked from: https://github.com/get-convex/turbo-expo-nextjs-clerk-convex-monorepo/
- Web UI is written in typescript, and is React native.
- Backend database is powered by Conv…
-
### Willingness to contribute
Yes. I can contribute this feature independently.
### Proposal Summary
SageMaker JumpStart provides an easy way to deploy LLM endpoints on the SageMaker managed …
-
What tools can I use?
-
Checklist
- [X] Create `gateway/internal/provider/huggingface/huggingface.go` ✓ https://github.com/uniAIDevs/ai/commit/60911bdcddb9e8d725b85d2cf0f8421262fce932 [Edit](https://github.com/uniAIDevs…
-
#### Description:
I am experiencing an unexpected spike in GPU memory usage when loading the `Meta-Llama-3.1-8B-Instruct-AWQ-INT4` model using the vLLM framework. Initially, the GPU memory usage is…
-
### Your current environment
```text
PyTorch version: 2.3.0+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N/A
OS: Ubuntu 22.04.4 LTS (x86_64)
GCC ve…
-
user group
https://github.com/open-webui/open-webui/issues/1096
- [x] Admin members can create user groups and assign user group specific settings (e.g. which models they have access to, user pe…
tjbck updated
15 hours ago
-
It seems like these conversions will be useful to users, so I think we should integrate some of the existing solutions into the library. @ThibaultLemaire has created bindings from Rust streams to asyn…
-
I have modified the code in these areas of the program
```
const openai = new OpenAI({
apiKey: process.env.OPENAI_API_KEY,
baseURL: 'my api url',
});
/openv0/server/modules/multipass…