Intel® Gaudi® AI Accelerator Examples for Training and Inference
Model List and Performance Data
Please visit this page for performance information.
This repository is a collection of models that have been ported to run on Intel Gaudi AI accelerator. They are intended as examples, and will be reasonably optimized for performance while still being easy to read.
Computer Vision
Natural Language Processing
Audio
Models |
Framework |
Validated on Gaudi |
Validated on Gaudi 2 |
Wav2Vec2ForCTC |
PyTorch |
Inference |
Inference |
Generative Models
MLPerf™ Training 4.0
Models |
Framework |
Validated on Gaudi |
Validated on Gaudi 2 |
GPT3 |
PyTorch |
- |
Training |
Llama 70B LoRA |
PyTorch |
- |
Training |
MLPerf™ Inference 4.0
MLPerf™ is a trademark and service mark of MLCommons Association in the United States and other countries. All rights reserved. Unauthorized use is strictly prohibited.
Reporting Bugs/Feature Requests
We welcome you to use the GitHub issue tracker to report bugs or suggest features.
When filing an issue, please check existing open, or recently closed, issues to make sure somebody else hasn't already
reported the issue. Please try to include as much information as you can. Details like these are incredibly useful:
- A reproducible test case or series of steps
- The version of our code being used
- Any modifications you've made relevant to the bug
- Anything unusual about your environment or deployment
Community
Hugging Face
Megatron-DeepSpeed
DeepSpeed-Chat
Fairseq