aws-samples / awsome-distributed-training

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
MIT No Attribution
134 stars 57 forks source link

Eks examples - FSDP example and documentation added #362

Open iankouls-aws opened 1 week ago

iankouls-aws commented 1 week ago

Issue #, if available:

Description of changes:

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.