llm-ops Search Results - Githubissues

1000+ results
for llm-ops

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #8654

[Bug]: RuntimeError in gptq_marlin_24_gemm

### Your current environment python 3.8 L20*4 vllm 0.5.4 ### Model Input Dumps _No response_ ### 🐛 Describe the bug $python -m vllm.entrypoints.api_server --model='/mntfn/yanyi/Qwen2-…

leoyuppieqnew updated 2 months ago
5
vllm-project/vllm #3061

Building VLLM from source and running inference: No module n…

Hi, after building vllm from source, the following error occures when running a multi-gpu inference using a local ray instance: ``` File "vllm/vllm/model_executor/layers/quantization/awq.py", lin…

Lena-Jurkschat updated 3 weeks ago
12
vllm-project/vllm #8177

[Bug]: watchdog thread terminated with exception: CUDA error…

### Your current environment The output of `python collect_env.py`. ```text Collecting environment information... PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyT…

DreamGenX updated 1 week ago
8
vllm-project/vllm #8881

[Bug]: assert len(self._async_stopped) == 0

### Your current environment The output of `python collect_env.py` ```text # For security purposes, please feel free to check the contents of collect_env.py before running it. python collect_e…

sfc-gh-zhwang updated 1 month ago
6
ActionAgents/jianghao0718-demo #3

AI with IssueOps

Get implementation steps for Issue Ops command like `/ai` to read comments of an issue or PR and reply with similar functionality of ActionAgents to answer with LLM.

jianghao0718 updated 3 months ago
1
vllm-project/vllm #6204

[Bug]: got RuntimeError: Triton Error [CUDA]: device kernel …

### Your current environment ```text The output of `python collect_env.py` ``` Collecting environment information... WARNING 07-08 14:14:25 _custom_ops.py:14] Failed to import from vllm._C with M…

tytcc updated 1 month ago
1
janhq/cortex.cpp #1069

epic: unitest for cortex cpp

We need create unitest for done ticket. For now we will use [Gtest](https://github.com/google/googletest) to write unitest Unitest can run locally and add to CI pipeline When building debug mode wil…

nguyenhoangthuan99 updated 1 month ago
1
pipecat-ai/pipecat #368

adding Dify.ai support for LLM service

would be good to support dify api to handle all the LLM Ops and RAG. from: https://docs.dify.ai/ Dify is an open-source large language model (LLM) application development platform. It combines t…

ramishi updated 3 months ago
1
google-ai-edge/ai-edge-torch #293

Error Using Converted Phi-3.5-mini TFLite in Android App

### Description of the bug: I downloaded the `microsoft/Phi-3.5-mini-instruct` from Hugging Face and ran the [convert_phi3_to_tflite.py](https://github.com/google-ai-edge/ai-edge-torch/blob/main/ai_…

chienhuikuo updated 3 weeks ago
13
vllm-project/vllm #4274

[Bug]: Open AI server error with CPU only engine

### Your current environment ```text Collecting environment information... PyTorch version: 2.2.1+cpu Is debug build: False CUDA used to build PyTorch: None ROCM used to build PyTorch: N/A OS…

maktukmak updated 3 weeks ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for llm-ops

1000+ results
for llm-ops