-
**server:** inf2.8xlarge
**vllm version**: 0.6.3.post2.dev77+g2394962d.neuron215
_Desctiption_
Hellow! I am trying to run the code below (the code was taken [here](https://docs.vllm.ai/en/v0.4.1/…
-
### System Info
```shell
System Configuration: Single node Habana Gaudi setup
Firmware Version: hl-1.15.0-fw-48.2.1.1
Software Stack: Synapse AI 1.15
```
### Information
- [ ] The official examp…
-
## 🐛 Bug
When running UVA training on an edge-centric dataset, and doing sampling on a subset of indces, these lines require 5x the size of the indices tensor worth of GPU memory, causing OOMs with m…
-
### Checklist
- [X] 1. I have searched related issues but cannot get the expected help.
- [ ] 2. The bug has not been fixed in the latest version.
- [ ] 3. Please note that if the bug-related issue y…
-
Hi
I have a heap dump file that i'm trying to convert into perfview format gcdump.
the heap dump file is 3.6 GB
The error message is as follows:
> Sampled Graph node count 252ÿ151 (re…
-
The profiler seems to consume 838MB of memory with a profile that has 6000 tracks. I can't upload the profile as it's from a corp workstation, but I'll try to recreate one at home later (I believe any…
-
### Your current environment
The output of `python collect_env.py`
```text
PyTorch version: 2.3.1+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N…
-
Thanks for the book. When running this notebook the program fails on the training step
`TypeError: An op outside of the function building code is being passed
a "Graph" tensor. It is possible to h…
-
Hello, I found that in the definition of `conv_2d`, src/util.py, [kernel_initializer](https://github.com/VXallset/deep-high-resolution-net.TensorFlow/blob/d885abc6f8699f5dfd09b270170f3c68fbf32ac2/src…
-
Hi Fan-Yun,
Thanks for sharing your code. Do you plan to release the code with mini-batch training and negative sampling for large graphs? Thank you.