-
### Priority
Undecided
### OS type
Ubuntu
### Hardware type
Gaudi2
### Installation method
- [X] Pull docker images from hub.docker.com
- [ ] Build docker images from source
### Deploy method
…
-
### System Info
```shell
When I use the k8s sample example for lora for llama3 8B model it works fine. But for 70b model it fails with OOM.
Total number of GPUs: 8 x Gaudi3 GPUs
Dataset: databr…
-
**Feature Overview (aka. Goal Summary)**
Implement Intel Gaudi support in InstructLab project, so Gaudi 2 and Gaudi 3 can be used for SDG, evaluation, and training.
**Goals (aka. expected user out…
ktam3 updated
3 weeks ago
-
### System Info
```shell
vault.habana.ai/gaudi-docker/1.17.0/ubuntu22.04/habanalabs/pytorch-installer-2.3.1:latest
```
### Information
- [X] The official example scripts
- [ ] My own modified scri…
-
### 📚 The doc issue
The link to [Intel Gaudi Software Stack Verification](https://docs.habana.ai/en/latest/Installation_Guide/SW_Verification.html#platform-upgrade) is broken in the Requirements and …
-
- Gaudi reference installation link is just linked and no version description, which result in compatibility issues.
- Considering using operator to provision and manage Gaudi drivers of clus…
-
### System Info
```shell
HL-SMI Version: hl-1.17.0-fw-51.3.0
Driver Version: 1.17.0-28a11ca
Docker image: vault.habana.ai/gaudi-docker/1.17.0/ubuntu22.04/habanalabs/pytorch-installer-2.3.…
-
### System Info
Optimum Habana: 1.10.4
Synapse: 1.14.0
Dockerfile:
```
FROM vault.habana.ai/gaudi-docker/1.14.0/ubuntu22.04/habanalabs/pytorch-installer-2.1.1:latest
# Installs pdsh and upg…
-
### Your current environment
The output of `python collect_env.py`
```text
Collecting environment information...
/usr/lib64/python3.11/inspect.py:389: FutureWarning: `torch.distributed.reduce_…
-
### System Info
```shell
Docker image: pytorch-installer-2.3.1:1.17.0-417
optimum-habana: main branch
```
### Information
- [ ] The official example scripts
- [X] My own modified script…