cisco-open / pymultiworld

A framework for PyTorch to enable fault management for collective communication libraries (CCL) such as NCCL
Apache License 2.0
15 stars 4 forks source link

refactor: update collective operations' signatures #31

Closed myungjin closed 2 months ago

myungjin commented 2 months ago

Description

To make collective operations' signatures consistent with torch's, our signatures are restructured.

Type of Change

Checklist