bytedance / byteps

A high performance and generic framework for distributed DNN training
Other
3.63k stars 490 forks source link

Is model parallelism supported for PyTorch? #382

Open liaopeiyuan opened 3 years ago

liaopeiyuan commented 3 years ago

If I write my own multi-GPU model or use torch.distributed.pipeline.sync.Pipe, would multi-node training still work with byteps?

ymjiang commented 3 years ago

We are working on supporting model parallelism. For now, you can still use BytePS to optimize the allreduce primitive in your code.