llm-ops Search Results - Githubissues

1000+ results
for llm-ops

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #9918

[Bug]: illegal memory access error when using prefix caching

### Your current environment The output of `python collect_env.py` ```text PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N…

StevenTang1998 updated 2 weeks ago
5
NVIDIA/TensorRT-LLM #1770

Fail to build w4a8_awq on Llama 13b

### System Info ubuntu 20.04 tensorrt 10.0.1 tensorrt-cu12 10.0.1 tensorrt-cu12-bindings 10.0.1 tensorrt-cu12-libs 10.0.1 tensorrt-llm …

Hongbosherlock updated 2 weeks ago
12
vllm-project/vllm #9551

[Usage]: Custom LLM Generate

### Your current environment ```text The output of `python collect_env.py` ``` ### How would you like to use vllm I'm implementating a custom algorithm that requires a custom generate met…

Blaizzy updated 1 month ago
12
ggerganov/llama.cpp #10161

Bug: CANN E89999

### What happened? common_init_from_params: warming up the model with an empty run - please wait ... (--no-warmup to disable) /owner/ninth/llama.cpp/ggml/src/ggml-cann.cpp:61: CANN error: E89999: In…

ninth99 updated 2 weeks ago
15
unslothai/unsloth #1268

cannot load some models via vllm

here is the summary: `unsloth/mistral-7b-v0.3-bnb-4bit` with error : ` KeyError: 'layers.0.mlp.down_proj.weight'` `unsloth/Qwen2.5-7B-Instruct-bnb-4bit` with error: `KeyError: 'layers.0.mlp.down_pro…

yananchen1989 updated 1 week ago
11
NVIDIA/TensorRT-LLM #1580

Fail to build int4_awq on Mixtral 8x7b

### System Info ubuntu 20.04 tensorrt 10.0.1 tensorrt-cu12 10.0.1 tensorrt-cu12-bindings 10.0.1 tensorrt-cu12-libs 10.0.1 tensorrt-llm 0.10.…

gloritygithub11 updated 1 month ago
17
vllm-project/vllm #4293

[Bug]: Engine iteration timed out. This should never happen …

### Your current environment ``` Collecting environment information... PyTorch version: 2.2.1+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: …

blackblue9 updated 1 month ago
7
salesforce/LAVIS #507

Is it a writting Error about bos_token in instuctblip?

Hello, thanks to your great work! In `blip2_vicuna_instruct.py`, the `bos_token` of LLM has been changed. Originally, it is '< s >' with idx:1. But after the following code: ``` self.llm_tokenize…

Coobiw updated 8 months ago
2
opendatalab/PDF-Extract-Kit #38

Cannot run on Mac M-chip

Errors as the following: (.venv) (base) pengxiong@PENGMacPro PDF-Extract-Kit % python pdf_extract.py --pdf demo/demo1.pdf [2024-07-19 20:17:51,713] [ ERROR] check_version.py:39 - Error fetching …

bookandlover updated 4 months ago
11
floneum/floneum #280

Unable to download models : Unexpected status code: 401

**Problem** No source are working on kalosm::language: error Unexpected status code: 401 **Steps To Reproduce** Example code: ``` use kalosm::language::*; #[tokio::main] async fn main()…

dbkblk updated 2 months ago
2

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for llm-ops

1000+ results
for llm-ops