distributed-llm Search Results

1000+ results
for distributed-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #4381

[Bug]: Chunked prefill doesn't seem to work when --kv-cache-…

### Your current environment H100 (but I believe it happens in any machine) ### 🐛 Describe the bug ``` --enable-chunked-prefill --num-max-batched-tokens 2048 --kv-cache-dtype "fp8" ``` S…

rkooo567 updated 1 month ago
11
crewAIInc/crewAI #690

While executing multiagent system using CrewAI encounter "li…

##Version of crewai ``` crewai==0.28.8 crewai_tools==0.1.6 ``` ## Code implementation ``` from langchain_groq import ChatGroq llm = ChatGroq(temperature=0, model_name="llama3-70b-8192") f…

plaban1981 updated 2 months ago
2
nextcloud/server #44002

[Bug]: `occ fulltextsearch:index` cause LDAP `getOCName()` e…

### ⚠️ This issue respects the following points: ⚠️ - [X] This is a **bug**, not a question or a configuration/webserver/proxy issue. - [ ] This issue is **not** already reported on [Github](https…

Uptobillion updated 3 months ago
4
vllm-project/vllm #6192

[Bug]: gemma-2-27b error loading with vllm.LLM

### Your current environment ```text Address sizes: 43 bits physical, 48 bits virtual CPU(s): 128 On-line CPU(s) list: 0-127 Thread(s) per c…

jl3676 updated 2 months ago
1
logikon-ai/cot-eval #43

Evaluate: core42/jais-XX

For `XX` in [13b, 13b-chat, 30b-v3, 30b-chat-v3]: Check upon issue creation: * [x] The model has not been evaluated yet and doesn't show up on the [CoT Leaderboard](https://huggingface.co/space…

ggbetz updated 4 months ago
2
vllm-project/llm-compressor #146

CUDA OOM Error during Compression of Meta-Llama/Llama-2-13B-…

**Describe the bug** When attempting to compress the Meta-Llama/Llama-2-13b-chat-hf model to W8A8 using a combination of GPTQ and SmoothQuant algorithms on an NVIDIA A800 GPU with 80GB of VRAM, I enc…

hxer7963 updated 2 days ago
3
WuNein/vllm4mteb #1

AttributeError: 'Worker' object has no attribute 'model'

此行代码会报错 ``` File "/data4/kaisi/RETA-LLM/indexer/index_baichuan.py", line 116, in build_model model = self.llm.llm_engine.workers[0].model AttributeError: 'Worker' object has no attribute 'mo…

guankaisi updated 3 months ago
8
OpenBMB/MiniCPM-V #131

MiniCPM-V-2微调报错

报错信息： > Traceback (most recent call last): > File "/xx/MiniCPM-V/finetune/finetune.py", line 124, in > train() > File "/xx/MiniCPM-V/finetune/finetune.py", line 119, in train > tra…

Zmeo updated 3 months ago
3
vllm-project/vllm #6169

[Bug]: TypeError: 'NoneType' object is not callable when loa…

### Your current environment Idk how to run it inside a docker ### 🐛 Describe the bug Simply run the following command `docker run --runtime nvidia --gpus all -v ~/.cache/huggingface:/root/.ca…

DanielusG updated 2 months ago
10
vllm-project/vllm #1285

does vllm support call generate concurrent in multithreading…

I use grpc server multithreading to do infer，but get error as fllowing File "/usr/local/lib/python3.8/site-packages/vllm/entrypoints/llm.py", line 130, in generate return self._run_engine(us…

smallmocha updated 1 month ago
6

上一页 1...88 89 90 91 92 93 94...100 下一页

1000+ results for distributed-llm

1000+ results
for distributed-llm