llm-ops Search Results - Githubissues

1000+ results
for llm-ops

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-LLM #1770

Fail to build w4a8_awq on Llama 13b

### System Info ubuntu 20.04 tensorrt 10.0.1 tensorrt-cu12 10.0.1 tensorrt-cu12-bindings 10.0.1 tensorrt-cu12-libs 10.0.1 tensorrt-llm …

Hongbosherlock updated 1 week ago
12
vllm-project/vllm #6713

[Bug]: RuntimeError: GET was unable to find an engine to exe…

### Your current environment ```text The output of `python collect_env.py` Collecting environment information... PyTorch version: 2.3.1+cu118 Is debug build: False CUDA used to build PyTorch…

fdas3213 updated 2 weeks ago
11
alibaba/MNN #3053

【编译MNN】编译完成后，已在虚拟Python环境中安装PyMNN，导入MNN报错

# 平台(如果交叉编译请再附上交叉编译目标平台): RK3588，本地编译 gcc版本11.4.0 g++版本 Ubuntu 22.04 # Github版本: git clone https://github.com/alibaba/MNN.git # 编译方式: cmake.. make -j4 python build_deps.py opencl py…

BerineYang updated 2 weeks ago
5
vllm-project/vllm #2798

Nvidia-H20 with nvcr.io/nvidia/pytorch:23.12-py3，CUBLAS Erro…

INFO 02-07 11:14:13 llm_engine.py:70] Initializing an LLM engine with config: model='/root/local_model_root/model/llama-2-7b/modelscope/Llama-2-7b-chat-ms', tokenizer='/root/local_model_root/model/lla…

tohneecao updated 2 months ago
4
vllm-project/vllm #8073

[Bug]: Persistent OutOfMemoryError error when using speculat…

### Your current environment The output of `python collect_env.py` ```text PyTorch version: 2.3.1+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N…

captify-sivakhno updated 2 days ago
2
vllm-project/vllm #3639

[Bug]: tensor model parallel group is not initialized

### Your current environment ```text PyTorch version: 2.1.2+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu 22.04.4 LTS (x86_64) GCC ve…

osafaimal updated 3 weeks ago
4
NVIDIA/TensorRT-LLM #1580

Fail to build int4_awq on Mixtral 8x7b

### System Info ubuntu 20.04 tensorrt 10.0.1 tensorrt-cu12 10.0.1 tensorrt-cu12-bindings 10.0.1 tensorrt-cu12-libs 10.0.1 tensorrt-llm 0.10.…

gloritygithub11 updated 1 month ago
17
vllm-project/vllm #5547

[Bug]: RuntimeError: CUDA error: no kernel image is availabl…

### Your current environment ```text The output of `python collect_env.py` ``` PyTorch version: 2.3.0 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A …

seungyoonee updated 1 week ago
10
vllm-project/vllm #5731

[Bug]: RuntimeError: CUDA error: CUBLAS_STATUS_EXECUTION_FAI…

### Your current environment ```text Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A …

medwang1 updated 1 month ago
1
vllm-project/vllm #9795

[Usage]: Running Phi3.5 on Intel x86 MacBook Pro?

### Your current environment ```text Collecting environment information... WARNING 10-29 12:20:54 _custom_ops.py:19] Failed to import from vllm._C with ModuleNotFoundError("No module named 'vllm._C…

neviaumi updated 3 weeks ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for llm-ops

1000+ results
for llm-ops