-
1. The assignment of `g_total_cuda_cores` in libvgpu/src/multiprocess/multiprocess_utilization_watcher.c is `g_total_cuda_cores = g_max_thread_per_sm * g_sm_num * FACTOR;`. Why is `Factor` set direc…
-
Allow more than one scheduler to work without problem with current `kue` concurrency setup
-
Work on the distribution of the example. Need to enable the configuration of scheduler.
- Put in some safety checks (RAM vs chunk size vs process numbers vs number of cores) -> Post a warning messa…
-
In CI (link may be bad) job: https://buildomat.eng.oxide.computer/wg/0/artefact/01J9KZDBT7Q76BMAZ3NFT2EM6G/JHuMk44VV03fuovquSYDTdL9uCEruARhbYRD7WhRUTif7Lam/01J9KZDV6QXGVBQG0ZHA6JNXTH/01J9M3TRMJHBD7KTG…
-
**Version**
`1.29.1`
**Platform**
`4.14.322-244.536.amzn2.x86_64`
**Description**
We are observing a panic in `StateCell::mark_pending`
https://github.com/tokio-rs/tokio/blob/a6be73eecbb2646…
-
### Describe the bug
Program process crashes when close window on MacOS
### Reproduction
_No response_
### Expected behavior
_No response_
### MacOS core dump
```
{"app_name":"clas…
-
### 🚀 The feature, motivation and pitch
I'm trying to write an inductor lowering for `torch._cslt_sparse_mm`, an aten op that takes has two optional tensor kwargs, `bias` and `alpha`.
https://git…
-
## Packages
Scylla version: `6.2.0-20241013.b8a9fd4e49e8` with build-id `a61f658b0408ba10663812f7a3b4d6aea7714fac`
Kernel Version: `6.8.0-1016-aws`
Scylla Manager Agent 3.3.3-0.20240912.924034e0d
…
-
### Your current environment
vllm 0.6.0
### 🐛 Describe the bug
when I try this
vllm serve neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8 --host 0.0.0.0 --port 8000 --tensor-parallel-size 8 --se…
-
### Describe the bug
ConnectionError: Tried to launch distributed communication on port 29401, but another process is utilizing it. Please specify a different port (such as using the --main_process_p…