ggml Search Results - Githubissues

1000+ results
for ggml

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

yoshoku/llama_cpp.rb #17

Error when trying to run the chat.rb example.

When trying to run the chat example I am getting an error. ruby chat.rb --model /playingwithai/models/llama-2-7b-chat.Q8_0.gguf llama_model_loader: loaded meta data with 19 key-value pairs and 291…

mybuddyandrew updated 10 months ago
1
ggerganov/ggml #29

Illegal instruction on Intel sandy bridge

Following the instructions on the readme, I get all the way to running the model. Then `./bin/gpt-2 -m models/gpt-2-117M/ggml-model.bin -p "This is an example"` gives me this output: main: seed = …

pikalover6 updated 1 year ago
1
Macoron/gpt4all.unity #12

Doesn't work with 4bit?

Tried using with several ggml bins (some from TheBloke, some from GPT4All) and it seems like this code only works with models not labeled as 4bit? Or am I missing something?

dreemur99 updated 1 year ago
3
cmp-nct/ggllm.cpp #17

slow on 3090 and very high cpu usage

I have a 3090 GPU, and converted the falcon-40b-instruct and quantized by Q3_K. But when I run the test, prediction is 3x slower than the reported, then I check the gpu and cpu uage, but GPU utils is …

stupiding updated 1 year ago
6
ggerganov/ggml #412

Multiplying large matrices for batched BERT inference

I'm trying to implement batched BERT inference based on the https://github.com/skeskinen/bert.cpp project. I'm running into the following assert error: https://github.com/ggerganov/ggml/blob/3dd91c…

novoselrok updated 1 year ago
4
lmstudio-ai/lmstudio-bug-tracker #102

v0.3.2 High CPU (no GPU offload?)

I'm noticing with v0.3.2 my CPU is getting slaughtered. The UI revamp is worse than the previous iteration with GPU offload now hidden on "My Models" page but even with all the layers assigned to GPU …

nPHYN1T3 updated 1 month ago
5
lm-sys/FastChat #925

fastchat-t5 quantization support?

Is there anyway to run it in 4G or less vram? ggml? or gptq?

bash99 updated 1 year ago
8
abetlen/llama-cpp-python #526

model_path error with Llama-2

I am trying to execute the following script: 1. from llama_cpp import Llama 2. llm = Llama(model_path="~/llama-2-7b.ggmlv3.q8_0.bin", n_gqa=8) 3. output = llm("Q: Name the planets in the solar sy…

jadehardouin updated 4 months ago
3
ggerganov/whisper.cpp #2308

Java cannot find libggml.so

Hello! I built the libwhisper.so and libggml.so under Linux with `make libwhisper.so` I have a SpringBoot application and put the native libs into src/main/resources/lib and set the System property …

olaf-thorbruegge updated 2 months ago
1
marella/ctransformers #134

Unable to compile for ROCM on Ubuntu 22.04

Using the command `$ CC="/opt/rocm/llvm/bin/clang" CXX="/opt/rocm/llvm/bin/clang++" CT_HIPBLAS=1 pip install ctransformers --no-binary ctransformers` I am unable to compile ctransformers for ROCM. I'v…

bugfixin updated 8 months ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for ggml

1000+ results
for ggml