-
During the hackathon I just picked a model without knowing too much about an ideal model fit. Think better and change if needed
-
This thread is dedicated to discussing the setup of the webui on AMD GPUs.
You are welcome to ask questions as well as share your experiences, tips, and insights to make the process easier for all…
-
Hi there, impressive work. Tested in on Linux and the VRAM and speeds with higher context is impressive (tested on exllamav2)
I've tried to do the same on Windows for exllamav2, but I have issues w…
-
That's the question, I can't use OpenAI and I would love to run this BabyAGI over the GPU in my local computer with some models like WizardLM or Gpt4-x-Vicuna, both quantized.
Do you plan to make a…
-
Besides the official OpenAI API, there are also other providers like Azure OpenAI. In some regions where the official API is not accessible, users heavily rely on these third-party APIs. Adding suppor…
-
Hi, i'm having difficulties loading this on the current versions of aiogram and oogabooga. I tried installing this as a standalone app and it didn't work because the cmd kept crashing because it wante…
-
cuda: 35tokens/s
triton: 5tokens/s
I used ooba's webui only for cuda, because I've been unable to get triton to work with ooba's webui, I made sure i used the same parameters as in the command for…
-
oobabooga extension support
this sure would make it way easier to adapt to model changes and such
-
### Describe the bug
llama.cpp models always gives exactly the same output (compared in winmerge to be sure), like they ignore any sampling options and seed. Sometimes the first output after loadin…
-
Jarvis is the built-in AI Agent in OpenDAN and serves as the primary operational channel for users to access various AI capabilities. Today, we have released the Jarvis Discord bot. With it, you will…