bytedance / byteps

A high performance and generic framework for distributed DNN training
Other
3.62k stars 487 forks source link

ps-lite: disable ucx error handling by default #424

Closed pleasantrabbit closed 2 years ago

pleasantrabbit commented 2 years ago

disable ucx signal handlers so that some faulty user code can still run even if some child process of the program encounters a segfault.

Signed-off-by: Yulu Jia yulu.jia@bytedance.com