instructlab / training

InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data
https://pypi.org/project/instructlab-training/
Apache License 2.0
21 stars 45 forks source link

[FSDP] [EPIC] Determine how well FSDP works on Intel Gaudi and AMD MI #202

Open RobotSail opened 2 months ago

RobotSail commented 2 months ago

We want to verify that FSDP works in the following scenarios:

JamesKunstle commented 1 month ago

We have validated that FSDP works fine on AMD cards. Haven't gotten anything to run correctly on Gaudi cards.