intel / torch-ccl

oneCCL Bindings for Pytorch*
BSD 3-Clause "New" or "Revised" License
86 stars 25 forks source link

Improve simple demo for multi-nodes with README and minor changes #63

Closed louie-tsai closed 4 months ago

louie-tsai commented 6 months ago

Improve the demo for multi-nodes run with README for multi-nodes instructions over ethernet. also changed the fixed master IP in demo.py for multi-nodes run

aice-support commented 6 months ago

@chengjunlu @zhuhong61 @liangan1 Could you help to review and merge if it looks to you?

jingxu10 commented 5 months ago

verified its functionality?

louie-tsai commented 5 months ago

verified its functionality?

verified with ipex llm dockerfile and https://github.com/intel/ai-containers/pull/61

jingxu10 commented 5 months ago

Looks good to me. @chengjunlu @zhuhong61 @liangan1 Could you take a review?