hpu Search Results - Githubissues

huggingface/optimum-habana #1314

CodeGen inference error "synNodeCreateWithId failed for node…

### System Info ```shell +-----------------------------------------------------------------------------+ | HL-SMI Version: hl-1.17.0-fw-51.1.0 | | Driver Ver…

caijimin updated 3 weeks ago

RVC-Boss/GPT-SoVITS #1647

api_v2.py 启动报错-HPU device module is not loaded

启动命令： `python api_v2.py -a 0.0.0.0 -p 7860 -c GPT_SoVITS/configs/tts_infer.yaml` tts_infer.yaml: ` custom: bert_base_path: GPT_SoVITS/pretrained_models/chinese-roberta-wwm-ext-large cnh…

njzfw1024 updated 6 days ago

ModelCloud/GPTQModel #94

[FEATURE] Intel/Habana HPU Support

Integrate intel/Habana HPU kernel support for uint4 inference on Habana HPU. This was merged into AutoGPTQ https://github.com/AutoGPTQ/AutoGPTQ/pull/689/files but there are no ci tests and we have no …

Qubitium updated 3 months ago

instructlab/instructlab #2218

[Epic] Add Gaudi support to InstructLab CLI, eval, and train…

**Feature Overview (aka. Goal Summary)** Implement Intel Gaudi support in InstructLab project, so Gaudi 2 and Gaudi 3 can be used for SDG, evaluation, and training. **Goals (aka. expected user out…

ktam3 updated 20 hours ago

sktime/pytorch-forecasting #1576

[ENH] option to turn off printing in prediction/inference

I want to use model.predict in a loop. It keeps printing this: GPU available: True (cuda), used: True TPU available: False, using: 0 TPU cores IPU available: False, using: 0 IPUs HPU available:…

dara1400 updated 1 month ago

huggingface/optimum-habana #1318

Flash attention not supported in run_clm.py

### System Info ```shell HL-SMI Version: hl-1.17.0-fw-51.3.0 Driver Version: 1.17.0-28a11ca Docker image: vault.habana.ai/gaudi-docker/1.17.0/ubuntu22.04/habanalabs/pytorch-installer-2.3.…

aitss2017 updated 3 weeks ago

HabanaAI/vllm-fork #193

[Usage]: vllm can't run qwen 32B inference

### Your current environment ```text PyTorch version: 2.2.0a0+git8964477 Is debug build: False CUDA used to build PyTorch: None ROCM used to build PyTorch: N/A OS: Ubuntu 22.04.4 LTS (x86_64) …

kunger97 updated 3 weeks ago

HabanaAI/vllm-fork #273

[Bug]: Prefix Caching raises error

### Your current environment ```text Collecting environment information... WARNING 09-11 21:43:04 _custom_ops.py:14] Failed to import from vllm._C with ModuleNotFoundError("No module named 'vllm.…

Robinysh updated 3 weeks ago

huggingface/optimum-habana #1049

Support for auto-language-detection Whisper inference on HPU

### Feature request Current Whisper inference works well with specified language. However, it does not support passing `language=None`, which can detect the language automatically. A `RuntimeError` i…

Spycsh updated 2 months ago

HabanaAI/vllm-fork #211

[Misc]: issue with loading weights from safetensors files

### Anything you want to discuss about vllm. While implementing disaggregated prefill, we found an error regarding loading weights from safetensors files. We have filed a JIRA ticket[(HS-3164)](https…

huijjj updated 2 weeks ago

1000+ results for hpu

1000+ results
for hpu