llama-cpp Search Results

1000+ results
for llama-cpp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ObrienlabsDev/machine-learning #13

Google Gemma 7B 2B OSS models are available on Hugging Face …

see #27 https://ai.google.dev/gemma/docs?hl=en https://www.kaggle.com/models/google/gemma Gemma on Vertex AI Model garden https://console.cloud.google.com/vertex-ai/publishers/google/model-gard…

obriensystems updated 4 months ago
22
abetlen/llama-cpp-python #720

Install with pytorch own cudatoolkit?

I'm using a server with Ubuntu 20.04.6 LTS with a V100 GPU. I'm not admin, and I can't install cudatoolkit at system level. I installed pytorch (with conda), which uses its own cudatoolkit. I have no…

fcivardi updated 11 months ago
1
bentoml/openllm-models #30

[Feature] How about to add flan-t5 series models

Just about to add this models to list.

YuriyGavrilov updated 2 weeks ago
3
huggingface/candle #1939

Quantized much slower than llama.cpp with same model and set…

quantized compiled using --> cargo build --example quantized -r --features metal Unsure of... how many layers accelerated / how many threads used / clearly different sample stages ..yet I pres…

oddpxl updated 7 months ago
22
abetlen/llama-cpp-python #771

Add batched inference

- [x] Use `llama_decode` instead of deprecated `llama_eval` in `Llama` class - [ ] Implement batched inference support for `generate` and `create_completion` methods in `Llama` class - [ ] Add suppo…

abetlen updated 1 month ago
35
ggerganov/llama.cpp #5762

Clean up server code

## Motivation As seen on https://github.com/ggerganov/llama.cpp/issues/4216 , one of the important task is to refactor / clean up the server code so that it's easier to maintain. However, without a…

ngxson updated 7 months ago
3
tinyBigGAMES/LMEngine #8

Doesn't work with Llama 3.1

Does not start with the Llama 3.1 model. Is it possible to make changes to work with Llama 3.1? This is now the model with the most tokens and will potentially be used everywhere.

avitos updated 3 weeks ago
8
mudler/LocalAI #976

SIGSEGV: GGML_ASSERT: /build/go-llama-stable/llama.cpp/ggml-…

**LocalAI version:** ``` v1.25.0-cublas-cuda12-ffmpeg ``` **Environment, CPU architecture, OS, and Version:** ``` # uname -a Linux localai-ix-chart-f8bbbb7c7-x6xx9 6.1.42-production+truen…

racerxdl updated 10 months ago
2
abetlen/llama-cpp-python #1756

chatml-function-calling chat format fails to generate multi …

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [x] I am running the latest code. Development is very rapid so there are no tagged versions as of…

jeffmaury updated 1 month ago
1
keldenl/gpt-llama.cpp #12

Problems on linux

First problem is that port 443 is usually reserved. I edited index.js to 8080. Next problem is that it crashes on first request: ``` /src/gpt-llama.cpp > npm start > gpt-llama.cpp@0.1.9 star…

atisharma updated 1 year ago
10

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for llama-cpp

1000+ results
for llama-cpp