habana Search Results - Githubissues

opea-project/GenAIExamples #1164

[Bug] OPEA docker container and habana runtime issue

### Priority P3-Medium ### OS type Ubuntu ### Hardware type Xeon-ICX ### Installation method - [ ] Pull docker images from hub.docker.com - [ ] Build docker images from source …

anatu-git updated 1 week ago

huggingface/optimum-habana #1467

Error when running llama2_fine_tuning_inference & Intel_Gaud…

### System Info ```shell Google Colab (CPU runtime) ``` ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [X] An officially supported task in the `exam…

epage480 updated 1 month ago

NTUT-intel-Gaudi/records #2

RuntimeError: [Rank:0] FATAL ERROR :: MODULE:PT_DEVMEM Alloc…

``` cd /root/workspace/github/optimum-habana/examples/text-generation/ python run_generation.py \ --model_name_or_path /root/workspace/model/meta-llama/Llama-3.1-8B/ \ --use_hpu_graphs \ --use_kv…

James-Lu-none updated 4 weeks ago

HabanaAI/vllm-fork #201

[Feature]: Compile warmup take too long

### 🚀 The feature, motivation and pitch ``` INFO 08-26 07:31:47 habana_model_runner.py:1192] [Warmup][Prompt][1/56] batch_size:64 seq_len:1024 free_mem:13.93 GiB INFO 08-26 07:32:25 habana_model_…

Zjq9409 updated 4 days ago

huggingface/optimum-habana #1469

RuntimeError: [Rank:0] FATAL ERROR :: MODULE:PT_DEVMEM Alloc…

### System Info ```shell Image: vault.habana.ai/gaudi-docker/1.17.1/ubuntu22.04/habanalabs/pytorch-installer-2.3.1:latest harware: Habana Labs Gaudi HL205 Mezzanine Card with HL-2000 AI Training …

James-Lu-none updated 4 weeks ago

HabanaAI/vllm-fork #510

[Usage]: Cannot use max_model_len greater than 8192 Tokens f…

### Your current environment ### Environment Details Running in a Kubernetes environment with Habana Gaudi2 accelerators: - **Hardware**: Habana Gaudi2 accelerators - **Deployment**: Kubernetes …

ppatel-eng updated 1 week ago

HabanaAI/vllm-fork #274

[Usage]: The TP improvement is not as expectation

### Your current environment The offline inference of Llama-3-8B with benchmark_latency.py sweeping on 1, 2, 4 cards results: And the optimum-habana results: The results show that on 1 card…

JunxiChhen updated 3 weeks ago

huggingface/optimum-habana #1152

Heavy IO in multi-node example

### System Info Optimum Habana: 1.10.4 Synapse: 1.14.0 Dockerfile: ``` FROM vault.habana.ai/gaudi-docker/1.14.0/ubuntu22.04/habanalabs/pytorch-installer-2.1.1:latest # Installs pdsh and upg…

rofinn updated 1 month ago

huggingface/optimum-habana #1318

Flash attention not supported in run_clm.py

### System Info ```shell HL-SMI Version: hl-1.17.0-fw-51.3.0 Driver Version: 1.17.0-28a11ca Docker image: vault.habana.ai/gaudi-docker/1.17.0/ubuntu22.04/habanalabs/pytorch-installer-2.3.…

aitss2017 updated 2 months ago

huggingface/optimum-habana #1426

bigscience / bloomz-7b1 finetune error

### System Info ```shell optimum 1.21.4 optimum-habana 1.14.0.dev0 transformers 4.45.2 +------------------------------------------------------------------…

11989890 updated 1 month ago

570 results for habana

570 results
for habana