huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
https://huggingface.co/docs/accelerate
Apache License 2.0
7.78k stars 943 forks source link

Accelerate - Vertex AI integration #1459

Open xenia-kra opened 1 year ago

xenia-kra commented 1 year ago

I can see that the "AWS sagemaker" option exists when selecting an environment for the training. What about Vertex AI - is it supported for the distributed scenario? How should I trigger the process - is it done through the notebook? How should the config look like?

github-actions[bot] commented 1 year ago

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.