-
### 🐛 Describe the bug
When using compile mode, i am getting graph breaks due with following errors
"[rank0]:I0702 10:23:29.307000 139872906987520 torch/_dynamo/variables/higher_order_ops.py:468] …
-
### Description
Hello Ray maintainers and community,
we've been using Ray for our works and find it to be a valuable tool for scalable and distributed machine learning. I believe it would be ben…
-
### Model description
https://github.com/huggingface/text-generation-inference/pull/1709
Since the TGI has done the LlaVa support. Would like to know if there is any timeline for the LlaVa support o…
-
- Here is my code which raise an error
```
att_map = self.sigmoid(x_spatial) # bs*1*7*7
x_combine = x_down4 * att_map + x_down4 # bs*512*7*7
x = self.bn2(x_combine) # bs*512*7*7
```
`Runtim…
-
-
### Name of Feature or Improvement
I'd like to change from a hardcoding of `nvidia.com/gpu` to instead having a dict or something of resources. There are other accelerators and it'd be nice to spec…
-
T2V is planned to enable inferencing LLMs like Stable Diffusion on CPU/GPU, and training on Habana Gaudi/DG2, as well as improving the Generated Video quality, like more realistic frame, and coherency…
-
I ran the infernce of Falcon-7b and neural-chat-7b-v3-1 models on ray server with below command
python inference/serve.py --config_file inference/models/neural-chat-7b-v3-1.yaml --simple
python infe…
-
### System Info
```shell
optimum-habana==1.10.4
docker: vault.habana.ai/gaudi-docker/1.14.0/ubuntu22.04/habanalabs/pytorch-installer-2.1.1:latest
```
### Information
- [ ] The official example sc…
-
### System Info
```shell
optimum-habana 1.8.0
docker vault.habana.ai/gaudi-docker/1.12.0/ubuntu22.04/habanalabs/pytorch-installer-2.0.1:latest
Synapse Version 1.12.0-480
https://github…