model-parallel Search Results

1000+ results
for model-parallel

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #10516

[Bug]: Model does not split in multiple Gpus instead it occu…

### Your current environment The output of `python collect_env.py` ```text Vllm version : 0.5.5 Nccl=2.20.5 Gpu : Telsa V100-sxm2-32GB Cuda version : 12.6 Driver version : 560.28.03 `…

anilkumar0502 updated 2 days ago
3
pytorch/torchchat #1376

[RFC] Integration of Distributed Inference into TorchChat

### 🚀 The feature, motivation and pitch **Overview** The goal of this RFC is to discuss the integration of distributed inference into TorchChat. Distributed inference leverages tensor parallelism …

mreso updated 5 days ago
4
UM-Bridge/umbridge #89

Parallelism not working with HPC load-balancer

I wonder if I might be doing something wrong, but it appears that all of my model evaluations are occurring in serial when I believe they should be occurring in parallel. If I define my `allocation…

jonmaddock updated 1 day ago
11
vllm-project/vllm #9875

[Bug]: Running on a single machine with multiple GPUs error

### Your current environment Name: vllm Version: 0.6.3.post2.dev171+g890ca360 ### Model Input Dumps _No response_ ### 🐛 Describe the bug I used the interface from this vllm repository …

Wiselnn570 updated 3 weeks ago
6
anthropics/anthropic-sdk-typescript #592

AWS Bedrock - extraneous key [disable_parallel_tool_use] is …

Bedrock SDK client cannot take in tool_choice param for `disable_parallel_tool_use`. I am on latest anthropic sdk 0.32.1 and bedrock package 0.11.2. ### Current Behavior: Supplying the field retur…

ShantanuNair updated 2 days ago
1
g-simmons/persona-research-internship #160

Task: Parallelize heatmap calculation

## Background *relevant information and motivation for this task* See #159. See https://aaltoscicomp.github.io/python-for-scicomp/parallel/ and let us know if it's useful. ## Task Compute…

g-simmons updated 1 week ago
1
vllm-project/vllm #7474

[Bug]: Model serving failed with these arguments --tensor-pa…

### Your current environment The output of `python collect_env.py` ```text PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A…

ab6995 updated 1 week ago
2
huggingface/optimum-neuron #734

Size mismatch while loading consolidated checkpoints trained…

### System Info Sagemaker Docker images: ```shell 763104351884.dkr.ecr.us-east-2.amazonaws.com/huggingface-pytorch-training-neuronx:1.13.1-transformers4.36.2-neuronx-py310-sdk2.18.0-ubuntu20.04 …

unography updated 4 days ago
1
QwenLM/Qwen2.5 #1092

[Bug]: 使用vllm进行推理时，设置parallel_tool_calls似乎不生效。我想实现单工具调用，应该怎么…

### Model Series Qwen2.5 ### What are the models used? Qwen2.5-72B-Instruction ### What is the scenario where the problem happened? vllm ### Is this a known issue? - [X] I have fo…

RayneSun updated 58 minutes ago
5
akash-akya/exile #47

[Optimize] graceful shutdown issues in some special cases.

Hi akash-aky, First of all, thank you for creating `Exile`, it's a very amazing library! I recently ran into some problems using it to execute `parallel` this application, here are my debug result: `…

EdmondFrank updated 1 week ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for model-parallel

1000+ results
for model-parallel