llama-cpp-python Search Results

1000+ results
for llama-cpp-python

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-LLM #875

benchmarking Mistra/Mixtral

Hi, I see you have built an example for Mistral models that I could build successfully. However, when I try to benchmark such models using GPTSessionBenchmark I get errors like: `[TensorRT-LLM][ERR…

MustafaFayez updated 2 weeks ago
3
abetlen/llama-cpp-python #1100

cmake build error: identifier "half" is undefined with versi…

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [x] I am running the latest code. Development is very rapid so there are no tagged versions as of…

jannikmi updated 10 months ago
2
THUDM/LongWriter #14

Error making gguf: KeyError: '<|user|>'

### System Info / 系統信息 transformers: 4.44.0 llama.cpp: latest Hi, when I try to make a gguf I get this error: > Traceback (most recent call last): File "/home/david/llm/llama.cpp/convert…

arch-btw updated 3 months ago
7
abetlen/llama-cpp-python #1346

Attribute Error: /llama.cpp/libllama.so: undefined symbol: l…

# Expected Behavior Once, I set the necessary environment variables (export LLAMA_CPP_LIB = /most recent build/libllama.so), the code executes without any error. # Current Behavior I created a…

VKonanur updated 7 months ago
3
reid41/QA-Pilot #70

Cannot install llama_cpp_python==0.2.79 because these packag…

``` Collecting FlashRank==0.2.5 (from -r requirements.txt (line 35)) Using cached FlashRank-0.2.5-py3-none-any.whl.metadata (11 kB) ``` ``` NFO: pip is looking at multiple versions of flashra…

holchan updated 4 months ago
1
ggerganov/llama.cpp #9788

Bug: Model Output Repeats and Shows Errors when Running GGUF…

### What happened? I converted the CodeLlama-7B-instruction model to GGUF format using llama.cpp, but encountered issues with model output when loading the converted GGUF file. The model outputs tex…

z7r7y7 updated 4 weeks ago
1
abetlen/llama-cpp-python #771

Add batched inference

- [x] Use `llama_decode` instead of deprecated `llama_eval` in `Llama` class - [ ] Implement batched inference support for `generate` and `create_completion` methods in `Llama` class - [ ] Add suppo…

abetlen updated 2 days ago
37
abetlen/llama-cpp-python #1270

Installation error I couldn't solve it in any way

lava-cli.dir\linkLibs.rsp C:\w64devkit\bin/ld.exe: C:/w64devkit/bin/../lib/gcc/x86_64-w64-mingw32/13.2.0/../../../../x86_64-w64-mingw32/lib/../lib/libpthread.a(libwinpthread_la-thread.o):thread…

start-life updated 5 months ago
3
ParisNeo/lollms-webui #542

A terabyte of python downloads and then.... boom!

``` Starting LOLLMS Web UI... ___ ___ ___ ___ ___ ___ /\__\ /\ \ /\__\ /\__\ /\__\ /\ \ /:/ / /::\…

0wwafa updated 4 months ago
2
EricLBuehler/mistral.rs #867

CUDA_ERROR_UNSUPPORTED_PTX_VERSION on Jetson AGX Orin

## Describe the bug When I run this command: ```bash cargo run --bin mistralrs-server --release --features "cuda" -- -i gguf -m /external/bradley/llama.cpp/models -f llama-31-70B-Q4-K-M.gguf `…

bmgxyz updated 1 month ago
2

上一页 1...23 24 25 26 27 28 29...100 下一页

1000+ results for llama-cpp-python

1000+ results
for llama-cpp-python