-
### Describe the problem the feature is intended to solve
I have several models loaded and not sure how can I know if Tensorflow still has some memory left. I can check using `nvidia-smi` how much me…
-
We observed `distributed/diagnostics/tests/test_nvml.py::test_gpu_monitoring_recent` fail in [this gpuCI build](https://gpuci.gpuopenanalytics.com/job/dask/job/distributed/job/prb/job/distributed-prb/…
-
### Ask your question
Running dcgm-exporter on k8s install via helm chart, default values.
Cluster have 1 master 1 worker, only worker have GPU expose as resource.
Running a simple query:
`DCGM_…
-
To start: This is not a bug unique to librehardwaremonitor. All indications are of an API bug. I am reporting this in the hope that this issue can be pushed upstream from here. Yes, I have contacted n…
ghost updated
2 years ago
-
**Is your feature request related to a problem? Please describe.**
On laptops with Nvidia discrete GPUs that has been set up to enter d3cold power state when idle, running btop will cause the discret…
-
- there is a different acceptable vtrust range for every subnet
- monitoring of weight setting attempts of a validator
- memory usage
- swap usage
- disk space
- CPU/loadavg usage
- GPU usage
- VRAM u…
-
**Describe the bug**
I compiled btop with GPU support, but the gpu stats aren't visible, because btop cant find libnvidia-ml.so
[A clear and concise description of what the bug is.]
The reason …
vsey updated
8 months ago
-
Hi Kozlek,
Thanks for your great work. I've been using HWsensors & HWMonitor on my hack for a quite a while. While HWMonitor.app works great on my Mac Pro, I didn't had nerves to try to install compl…
ghost updated
10 years ago
-
### Describe the issue
024-09-06 15:00:41.0156984 [E:onnxruntime:Default, provider_bridge_ort.cc:1992 onnxruntime::TryGetProviderInfo_CUDA] D:\a\_work\1\s\onnxruntime\core\session\provider_bridge_o…
-
My machine is recognizing the XPU, is able to run the samples but keeps crashing with the error:
2024-09-07 23:53:11.032822: I tensorflow/core/common_runtime/next_pluggable_device/next_pluggable_de…