llm-cpu Search Results - Githubissues

1000+ results
for llm-cpu

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Unstructured-IO/unstructured #3326

CPU only installation

I've been using unstructured for a while in a 100% cpu machine. I've noticed a lot of nvidia files (+2gb) in my venv folder coming from PyTorch (possible one of unstructured's dependencies). Can I in…

arthurbrenno updated 6 days ago
2
ggerganov/llama.cpp #7523

finetune error: ggml_flash_attn_ext() not yet supported

I have been finetuning a model based on `Meta-Llama-3-8B` using `finetune`. The model was downloaded from the `meta-llama` Hugging Face. I am running macOS on Apple Silicon. I recently updated llama.c…

michaelmidura updated 1 week ago
5
ggerganov/llama.cpp #8061

Bug: LoRA Finetuning fails for GPU offloading

### What happened? I try to finetune a llama-like model use `./llama-finetune`. 1. The program works **fine** when I use CPU only. 2. The program causes **segmentation fault** when I use GPU offl…

Ther-nullptr updated 1 day ago
4
airockchip/rknn-llm #29

Is the performance bottleneck of rknn llm in the CPU？

On the firefly board: The default operating mode of the CPU is interactive, with a frequency of 408000. The default operating mode of NPU is rknpu_ondemand, with a frequency of 1000000000. The defaul…

Caical updated 2 months ago
4
intel-analytics/ipex-llm #11409

ImportError: undefined symbol: iJIT_NotifyEvent on 2-ARC GPU

Trying to do inference on arc GPU machine, have followed this guidelines: ``` https://github.com/intel-analytics/ipex-llm/tree/main/python/llm/example/GPU/Pipeline-Parallel-Inference and run_mi…

raj-ritu17 updated 2 weeks ago
1
NVIDIA/TensorRT-LLM #1850

Assertion failed: Can't free tmp workspace for GEMM tactics …

### System Info - CPU architecture: x86_64 - CPU memory size: 128G - GPU name: NVIDIA GeForce GTX 1660S - GPU memory size: 6G - TensorRT-LLM branch: main - TensorRT-LLM commit: 9691e12 - Contai…

gyr66 updated 1 week ago
1
microsoft/onnxruntime #20896

OpenCL and Mali GPU support left out of all execution provid…

I was trying to migrate from MLC-LLM to onnxruntime to run Phi-3 on an Orange Pi 5 but I realize that among ALL your execution providers there isn't a single one that takes advantage of the GPU or NPU…

federicoparra updated 1 week ago
2
NVIDIA/TensorRT-LLM #1487

issue with Device 0 peer access Device x is not available.

### System Info gpu: ```nvidia-smi Mon Apr 22 17:00:40 2024 +---------------------------------------------------------------------------------------+ | NVIDIA-SMI 535.161.08 …

geraldstanje updated 2 months ago
3
NVIDIA/TensorRT-LLM #1694

LLama3 sq(per_token + per_channel) build failed on main bran…

### System Info - CPU architecture: x86_64 - GPU properties - GPU name: NVIDIA A100 - GPU memory size: 40G - Libraries - TensorRT-LLM branch or tag: main - TensorRT-LLM commit: 5d8ca2…

NaNAGISaSA updated 2 weeks ago
5
ollama/ollama #5239

Mutli-GPU asymmetric VRAM with smaller first causes ordering…

### What is the issue? After going to 0.1.45 from 0.1.43 version I get out of memory, I did try as well Set-ItemProperty -Path 'HKCU:\Environment' -Name 'OLLAMA_SCHED_SPREAD' -Value 1 and Set-It…

chrisoutwright updated 4 days ago
7

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for llm-cpu

1000+ results
for llm-cpu