ggml Search Results - Githubissues

1000+ results
for ggml

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/whisper.cpp #1901

CoreML on M1 gives a wrong transcription

Hi there I am following instructions to get CoreML working on Apple Silicon M1. after I get everything going and trying to transcribe the jfk sample, I only get a wrong transcription: ``` [00:0…

nbarrera updated 8 months ago
3
SJTU-IPADS/PowerInfer #129

not enough space in the buffer with long prompts

# Prerequisites Before submitting your question, please ensure the following: - [x] I am running the latest version of PowerInfer. Development is rapid, and as of now, there are no tagged versio…

RachelShalom updated 7 months ago
2
localagi/gpt4all-docker #2

ValueError: Model filename not in model list: ggml-gpt4all-j…

When running ```docker run localagi/gpt4all-cli:main repl``` I am getting this error: ``` Traceback (most recent call last): File "/cli/app.py", line 118, in app() File "/cli/app.p…

fogs updated 6 months ago
1
ggerganov/ggml #64

Cerebras-GPT whitespace □ tokens emitted

I have downloaded GPT-1.3B from Cerebras here https://huggingface.co/cerebras/Cerebras-GPT-1.3B After converting the model weights to GGML with ``` python3 ./examples/gpt-2/convert-cerebras-to-gg…

loretoparisi updated 1 year ago
1
ggerganov/whisper.cpp #828

Get SIGILL on Intel(R) Core(TM) i5-2520M (Thinkpad X220)

I tried running whisper.cpp on a Thinkpad X220 today, and the program crash with SIGILL. Is there some inline assembly assuming a newer CPU? ``` % gdb bin/main [...] (gdb) run -m model-medium…

petterreinholdtsen updated 1 year ago
4
ollama/ollama #7591

Cant compile ollma 0.4.1 on arm Jetson agx orin

### What is the issue? I have some issues to compile latest ollma on an ARM nvidia jetson plattform cuda version is 11.2 with jetpack 5.1.2. ggml-quants.c: In function ‘ggml_vec_dot_q4_0_q8_0’: …

rebotnix updated 5 days ago
3
hipudding/llama.cpp #2

TODO List

- [x] 解决当前运行时的错误。 - [x] 解决推理精度问题[LLAMA系FP16精度问题已解决]。 - [x] 使用内存池管理临时使用的dev或者host内存（buddy system）。还需要考虑及时释放已经使用完的内存（stream 同步时释放， dst重用时释放，需要优化成更优雅的方式），避免OOM。目前支持单卡，如果内存存在问题，考虑是否先同步运行。 - [ ] 自定义算子，使…

hipudding updated 5 days ago
4
ggerganov/ggml #4

Suggestion: Stable Diffusion Model

Been following along with your speed increases on Whisper using ggml, which have been amazing Would be interesting to see how stable diffusion runs on CPUs using ggml Here are current benchmarks…

jafri updated 1 year ago
6
ggerganov/ggml #189

Support MLIR

I would love to see MLIR support. MLIR implements a vulkan runner built in as well as a SPIRV cpu runner. It seems like it was up-voted but I don't see any discussion on why CUDA or OpenCL was added t…

theoparis updated 8 months ago
1
containers/ai-lab-recipes #547

llama-cpp-server broken

Got this while running from main branch in Podman AI Lab: ``` �llama_model_loader: loaded meta data with 25 key-value pairs and 291 tensors from /granite-7b-lab-Q4_K_M.gguf (version GGUF V3 (lates…

jeffmaury updated 5 months ago
7

上一页 1...87 88 89 90 91 92 93...100 下一页

1000+ results for ggml

1000+ results
for ggml