distributed-llm Search Results

1000+ results
for distributed-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Wox-launcher/Wox #4030

ChatGPT Plugin

How do I run the chatgpt plugin locally and does it work or is still being developed?

SashvDave updated 1 month ago
1
vllm-project/vllm #1908

Is there a way to terminate vllm.LLM and release the GPU mem…

After below code, is there an api(maybe like `llm.terminate`) to kill llm and release the GPU memory? ``` from vllm import LLM, SamplingParams prompts = [ "Hello, my name is", "The pres…

sfc-gh-zhwang updated 3 weeks ago
30
open-compass/opencompass #1298

[Bug] Alignbench无法使用VLLM模型评测，eval阶段卡住并报错

### 先决条件 - [X] 我已经搜索过 [问题](https://github.com/open-compass/opencompass/issues/) 和 [讨论](https://github.com/open-compass/opencompass/discussions) 但未得到预期的帮助。 - [X] 错误在 [最新版本](https://github.com/open-com…

IcyFeather233 updated 5 days ago
9
vllm-project/vllm #5443

[Bug]: v0.4.3 AsyncEngineDeadError

### Your current environment ```text Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A …

changshivek updated 2 weeks ago
6
vllm-project/vllm #3012

Unable to specify GPU usage in VLLM code

I am facing difficulties in specifying GPU usage for different models for LLM inference pipeline using vLLM. Specifically, I have 4 RTX 4090 GPUs available, and I aim to run a LLM with a size of 42GB …

humza-sami updated 1 month ago
15
InternLM/xtuner #772

RuntimeError: expected mat1 and mat2 to have the same dtype,…

![image](https://github.com/InternLM/xtuner/assets/145842232/83f12831-573f-4a42-8f19-905e8a5d57e6) How do I solve this problem? The error is as above, and the config is attached below # Copyri…

Yanllan updated 3 weeks ago
1
modelscope/swift #851

使用DDP运行时显存不够，但是使用Model Parallel时可以正常finetune，耗时很大

nproc_per_node=4 CUDA_VISIBLE_DEVICES=0,1,2,3 \ NPROC_PER_NODE=$nproc_per_node \ swift sft \ --model_id_or_path "AI-ModelScope/llava-v1.6-mistral-7b" \ --template_type "llava-mistral-inst…

AlexJJJChen updated 2 months ago
6
hiyouga/LLaMA-Factory #4608

fsdp + DPO + fullyfintune会报错

### Reminder - [X] I have read the README and searched the existing issues. ### System Info pass ### Reproduction ``` CUDA_VISIBLE_DEVICES="0,1,2,3,4,5,6,7" accelerate launch \ --config_fil…

qy1026 updated 2 weeks ago
1
vllm-project/vllm #6160

[Bug]: Batch expansion doesn't work with lora

### Your current environment ```text GPU 0: NVIDIA H100 80GB HBM3 GPU 1: NVIDIA H100 80GB HBM3 GPU 2: NVIDIA H100 80GB HBM3 GPU 3: NVIDIA H100 80GB HBM3 GPU 4: NVIDIA H100 80GB HBM3 GPU 5: NV…

Adhyyan1252 updated 5 days ago
2
vllm-project/vllm #5692

[Bug]:Qwen2-57B-A14B 两卡推理报错

### Your current environment 环境： torch 2.3.0 vllm 0.5.0.post1 transformers 4.41.2 主要报错情况： moe小一点的模型 '/data/models/qwen/qwen1.5-2.7Bmoe' 不会出问题对于大一点的就报错如最下面。代码： from vllm.engine.arg_ut…

CXLiang123 updated 1 week ago
6

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for distributed-llm

1000+ results
for distributed-llm