aws / aws-ofi-nccl

This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.
Apache License 2.0
147 stars 56 forks source link

feat: Region-based tuner support for P5en #704

Closed arunkarthik-akkart closed 1 day ago

arunkarthik-akkart commented 1 week ago

This patch has the below changes for the region based tuner for p5en

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

sunkuamzn commented 5 days ago

bot:aws:retest

a-szegel commented 2 days ago

bot:aws:retest ... infra issues, cluster disk space full

a-szegel commented 2 days ago

bot:aws:retest