aws-samples / awsome-distributed-training

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
MIT No Attribution
134 stars 57 forks source link

Efa node exporter eks #350

Closed awsankur closed 3 weeks ago

awsankur commented 3 weeks ago

Issue #, if available:

Description of changes:

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.