-
I've spent hours struggling to get all this to work. I would really appreciate any help anyone can offer.
I'm running in a Windows 10 environment.
I've tried running `npx dalai llama install 7B …
-
Running `npx dalai llama install 7B` fails with:
`PS C:\Users\xxxx\dalai\llama\build\Release> [System.Console]::OutputEncoding=[System.Console]::InputEncoding=[System.Text.Encoding]::UTF8; ./quanti…
-
**LocalAI version:**
Latest
**Environment, CPU architecture, OS, and Version:**
EC-2
**Describe the bug**
Getting the grpc connection error when running using cuda12 image. But when runni…
-
The instructions didn't tell that you need to install phyton in Termux. Well I found out how to do it after the missing phyton3 message.
But I have still troubles running altaera.
I get the error …
-
### Describe the bug
After updating I can no longer load any GGUF models, including ones that used to work. They give an error "OSError: exception: access violation reading 0x0000000000000000"
### I…
-
I am using the Linux system with GPU and installed ctransformers-0.2.14 using pip. It installed all fine. But now when I try to run the
GGML model quantized by @TheBloke (TheBloke/upstage-llama-30b-…
-
I found ckpt versions of Segmind Distilles diffusion ( https://github.com/segmind/distill-sd, https://huggingface.co/segmind ) models:
https://huggingface.co/ClashSAN/small-sd/resolve/main/smallSDd…
-
T4服务器启用了加速能力:
CMAKE_ARGS="-DGGML_CUDA=ON" pip install -U chatglm-cpp
CMAKE_ARGS="-DGGML_CUDA=ON" pip install 'chatglm-cpp[api]'
对量化模型进行启动:
MODEL=/home/ops/chatglm/chatglm.cpp/models/chatglm3-q8-0…
-
Hi there, thanks for the great library!! :bow:
Motivated by a pretty specific application (to get alpaca results "verifiable" by running on Cartesi which is ontop of blockchain), I'm trying to com…
-
We follow the guideline to setup IPEX-LLM[CPP] and also Intel OneAPI. but when we load Phi3 model to Intel GPU, it always have "SYCL error" happened. and we also use main.exe to load model manually, s…