tensor-parallelism Search Results

1000+ results
for tensor-parallelism

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/DeepSpeed #2906

[REQUEST] An example of Hybird Parallelism

Does DeepSpeed support Hybrid Parallelism? e.g. `data parallel + pipeline_parallel + tensor_parallel `. Can you show me an example how to use these parallelisms together?

KimmiShi updated 1 year ago
2
shawntan/scattermoe #11

Question: Multi-node training

Hi @shawntan, great work on Scatter MoE. As newer models are scaling up in the number of parameters used, I wanted to ask a question about what you put in the README: *does not include any additional …

casper-hansen updated 3 months ago
3
tunib-ai/parallelformers #48

Cross-node inference

Is there any way to perform tensor parallelism across multiple nodes instead just in a single node? Any tips would be helpful!

BDHU updated 10 months ago
3
coreylowman/dfdx #595

Multi-GPU Support

Scaling models requires they be trained in data-parallel, pipeline parallel, or tensor parallel regimes. The last two, being both "model parallel", require a single model to be shared across GPUs. Thi…

jafioti updated 3 months ago
9
mratsim/Arraymancer #616

2023-12-31 - Longstanding missing features

Arraymancer has become a key piece of Nim ecosystem. Unfortunately I do not have the time to develop it further for several reasons: - family, birth of family member, death of hobby time. - competin…

mratsim updated 3 months ago
25
pytorch/pytorch #129229

Incorrect behavior of dtensor full_tensor for TP+FSDP2

### 🐛 Describe the bug ```python import torch import os os.environ['NCCL_DEBUG'] = 'WARN' from torch import nn from torch import distributed as dist from torch.distributed.device_mesh import …

vermouth1992 updated 3 weeks ago
16
SHI-Labs/NATTEN #143

Is NATTEN Fused NA v0.17 faster than Flash Attention 2

I don't see backward speedup using NATTEN, even with only half size as kernel size when calling na3d(). I'm not sure if it's as expected. Could anyone help to clarify or confirm? Thanks!

goldhuang updated 3 weeks ago
3
vllm-project/vllm #3839

[Bug]: Error happen in async_llm_engine when use multiple GP…

### Your current environment ```text PyTorch version: 2.1.2+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu 20.04.6 LTS (x86_64) GCC ve…

for-just-we updated 1 month ago
11
haotian-liu/LLaVA #811

[Question] is model parallelism supported for training?

### Question Say I have an cluster with 8 GPU but only 12G vram each, I can still train llava? It seems that deepspeed can do a various of model parallelism (tensor parallelism, pipeline etc) I won…

fredshentu updated 8 months ago
1
microsoft/DeepSpeed #4704

[REQUEST]Support for multiple node inference?

Hi, I want to run one LLM model using multiple machines. On one node, I want to use tensor parallel to speedup. Within multiple nodes, I want to use pipeline parallel. Is this supported? If s…

sleepwalker2017 updated 5 months ago
9

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for tensor-parallelism

1000+ results
for tensor-parallelism