cisco-open / pymultiworld

A framework for PyTorch to enable fault management for collective communication libraries (CCL) such as NCCL
Apache License 2.0
16 stars 4 forks source link

fix: error handle in initial configuration #89

Closed myungjin closed 2 months ago

myungjin commented 2 months ago

Description

When post installation script is triggered during installation, the initialization check file (init.txt) is not present, which raises an exception. This leads to the failed installation. The issue is fixed by catching the error and suppressing it (which is okay because it is only happening during the installation).

Type of Change

Checklist