ggml Search Results - Githubissues

1000+ results
for ggml

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PromtEngineer/localGPT #434

Getting not "not enough space in the buffer"

Thanks for creating the awesome project. So I was trying to play around with the project. I have a couple of PDF's that i wanted to use (mostly around 300 pages). I have a laptop 3080 with 16GB …

amit2103 updated 1 year ago
2
Atome-FE/llama-node #108

Llama2 quantized q5_1

I am getting this error: ``` llama.cpp: loading model from /Documents/Proj/delta/llama-2-7b-chat/ggml-model-q5_1.bin error loading model: unrecognized tensor type 14 llama_init_from_file: failed…

HolmesDomain updated 1 year ago
1
LlamaEdge/rag-api-server #13

failed to run the api server

I followed the instructions in readme.md. it built successfully i guess. but when i run `wasmedge rag-api-server.wasm -h`, i got the following errors: ``` [2024-05-29 18:44:18.672] [error] instan…

chengchengpei updated 4 months ago
2
ggerganov/whisper.cpp #2476

Large-v3-turbo model not translating japanese into English

I just tried the new large-v3-turbo model on translating Japanese anime video into English. Instead of English, it gave the subtitles in Japanese, with each subtitle taking a block of 30 seconds in t…

zoltan-dulac updated 3 weeks ago
3
ggerganov/llama.cpp #10307

Bug: [Regression] Cannot build with hipblas

### What happened? I can no longer build llama.cpp with hipblas enabled. The following dockerfile can be used to reproduce the issue: ``` FROM rocm/pytorch ARG ROCM_TARGET_LST=/root/gfx RUN…

Mushoz updated 2 days ago
3
ggerganov/ggml #88

[Idea]: Use Android NNAPI to accelerate inference on Android…

This is just an idea for you. Most modern smartphones come with some form of AI accelerator. I am aware GGML-based projects like llama.cpp can compile and run on mobile devices, but there is probably …

Interpause updated 7 months ago
4
abetlen/llama-cpp-python #1280

Failed to load shared library \venv\Lib\site-packages\llama_…

Hi, I am running llama-cpp-python on surface book 2 having i7 with nvidea geforce gtx 1060. I installed vc++, cuda drivers 12.4 Running on Python 3.11.3 Compiled llama using below command on Min…

mahesh557 updated 3 months ago
6
SciSharp/LLamaSharp #835

[BUG]: When the number of GpuLayerCount is more than 5, no d…

### Description Hi, I am using the latest version of LLamaSharp and my model is Llama-3 70b gguf version, when the number of GpuLayerCount is 0 to 5, although it is not very fast, I get the answer, b…

nazihaghighi updated 2 months ago
10
huggingface/diffusers #9487

Add GGUF loader for FluxTransformer2DModel

[GGUF](https://huggingface.co/docs/hub/en/gguf) is becoming a preferred means of distribution of FLUX fine-tunes. Transformers recently added general support for GGUF and are slowly adding support …

vladmandic updated 1 week ago
14
abetlen/llama-cpp-python #437

M1 Metal Initialization failing when torch is not imported

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [x] I am running the latest code. Development is very rapid so there are no tagged versions as of…

remixer-dec updated 9 months ago
4

上一页 1...85 86 87 88 89 90 91...100 下一页

1000+ results for ggml

1000+ results
for ggml