llama-cpp Search Results

1000+ results
for llama-cpp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PromtEngineer/localGPT #813

Truncation not explicitly mention

I get this error when i Try to run a query Truncation was not explicitly activated but `max_length` is provided a specific value, please use `truncation=True` to explicitly truncate examples to max…

udbhav-44 updated 1 month ago
4
Atome-FE/llama-node #83

no kernel image is available for execution on the device

~/llama-node/packages/llama-cpp$ node example/mycode.ts llama.cpp: loading model from /llama-node/packages/llama-cpp/ggml-vic7b-uncensored-q5_1.bin llama_model_load_internal: format = ggjt v2 (…

determo13 updated 1 year ago
2
abetlen/llama-cpp-python #1545

Not Able To Utilize AMD GPU's

Can someone help me configure this Using Python 3.11 ROCm Version 5.5.1 × Building wheel for llama-cpp-python (pyproject.toml) did not run successfully. │ exit code: 1 ╰─> [55 lines o…

Essak786 updated 3 months ago
1
ggerganov/llama.cpp #6372

Suport for Jamba JambaForCausalLM

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [x] I am running the latest code. Development is very rapid so there are no tagged versions as of…

maziyarpanahi updated 5 hours ago
22
horseee/LLM-Pruner #16

Can prune model convert to llama.cpp ggml？

shaonianyr updated 1 year ago
1
abetlen/llama-cpp-python #1537

Llama3 instruct prompt template missing BOS token

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [x] I am running the latest code. Development is very rapid so there are no tagged versions as of…

pmbaumgartner updated 3 months ago
6
ollama/ollama #1237

GPTQ / ExLlamaV2 (EXL2) quantisation

# Feature Description Please provide a detailed written description of what you were trying to do, and what you expected `llama.cpp` to do as an enhancement. # Motivation It sounds like it's …

0xdevalias updated 1 month ago
6
ggerganov/llama.cpp #3229

metal : simplify kernel arguments using a struct

Create a struct `ggml_metal_locals` and populate using `GGML_TENSOR_LOCALS` similar to what we do in `ggml.c`: https://github.com/ggerganov/llama.cpp/blob/3b4bab6a38502d9e68587c2c19f26472480ec4dd/g…

ggerganov updated 4 months ago
3
LostRuins/koboldcpp #798

Is continous batching supported?

I am not able to find much on batching support. But it appears that the downstream llama.cpp supports it. https://github.com/ggerganov/llama.cpp/issues/4372 Any plans to expose this feature in k…

sirmo updated 5 months ago
1
ggerganov/ggml #295

ggml : improve CI + add more tests

The current state of the testing framework is pretty bad - we have a few simple test tools in [tests](https://github.com/ggerganov/ggml/tree/master/tests), but these are not maintained properly and ar…

ggerganov updated 5 months ago
9

上一页 1...90 91 92 93 94 95 96...100 下一页

1000+ results for llama-cpp

1000+ results
for llama-cpp