llm-cpu Search Results - Githubissues

1000+ results
for llm-cpu

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

dnhkng/GlaDOS #73

Can't connect to external LLM servers

Hi, having this issue with connecting to external llms. Enviroment server for remote LLM: - Amd 79503xd - 64 GB RAM - 2x 7900xtx - Using LM-STUDIO fosr hosting LLM server Enviroment Cli…

faneQ123 updated 4 days ago
4
NVIDIA/TensorRT-LLM #1865

can not run whisper on T4

### System Info x86_64 755G nvidia T4 ubuntu 22.04 trtllm version : https://github.com/NVIDIA/TensorRT-LLM/archive/9691e12bce7ae1c126c435a049eb516eb119486c.zip pip install tensorrt-llm==0.11…

ZJU-lishuang updated 1 week ago
1
ggerganov/llama.cpp #8202

Bug: Unable to generate the model output correctly

### What happened? I am using the llama-2-7b-chat.Q4_K_M.gguf and trying to run it using llama-cpp but I am not getting the actual output .I am getting output as # , not as any string. ### Nam…

Smupk2778 updated 1 week ago
5
ollama/ollama #5519

Ultraslow Inference on Chromebook

Update: I used to run ollama on this chromebook when tinyllama came out and it ran great. ### What is the issue? ![image](https://github.com/ollama/ollama/assets/13264408/e37d1a70-8d92-4281-88…

MeDott29 updated 1 day ago
1
triton-inference-server/tensorrtllm_backend #524

launch multi-gpu triton server and got an Error

### System Info 4*NVIDIA L20 ### Who can help? _No response_ ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially suppor…

dwq370 updated 4 days ago
1
intel-analytics/ipex-llm #11467

Unable to import LlamaCpp

Hi, I am unable to import LlamaCpp in IPEX CODE : from ipex_llm.langchain.llms import LlamaCpp ERROR Cell In[5], [line 1](vscode-notebook-cell:?execution_count=5&line=1) ----> [1](vscode-note…

abhishekkagautam updated 1 week ago
1
intel-analytics/ipex-llm #11340

Cannot find dGPU when install ollama on Windows

When "pip install ipex-llm[cpp]", then "init-ollama.bat", it runs on CPU: " ... msg="inference compute" id=0 library=cpu compute="" driver=0.0 name="" total="31.6 GiB" ... " But when "pip install …

YunLiu1 updated 3 weeks ago
1
NVIDIA/TensorRT-LLM #1786

Problems in running convert_checkpoint.py file given in the …

### System Info -CPU architecture: amd64 -Operating System: Windows 11 -Python version: 3.11.5 -TensorRT-LLM version: 0.10.0 -CUDA version: 12.5 -torch version: 2.2.0+cu121 ### Who can help? _…

JungleMist updated 3 weeks ago
3
NVIDIA/TensorRT-LLM #1768

Using TensorRT-LLM/examples/apps/fastapi_server.py as server…

### System Info - CPU architecture : x86_64 - CPU/Host memory size : 32 GB - GPU name L4 at g2-standard-8 (GCP) - GPU memory size 24GB - TensorRT-LLM branch or tag (e.g., main, v0.10.0) - Nvi…

snassimr updated 1 day ago
14
vllm-project/vllm #6160

[Bug]: Batch expansion doesn't work with lora

### Your current environment ```text GPU 0: NVIDIA H100 80GB HBM3 GPU 1: NVIDIA H100 80GB HBM3 GPU 2: NVIDIA H100 80GB HBM3 GPU 3: NVIDIA H100 80GB HBM3 GPU 4: NVIDIA H100 80GB HBM3 GPU 5: NV…

Adhyyan1252 updated 3 days ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for llm-cpu

1000+ results
for llm-cpu