-
THis is for discussion of [RFD 150](https://github.com/joyent/rfd/tree/master/rfd/0150) to operationalize Prometheus and Grafana in Triton and Manta.
-
With the introduction of https://github.com/elastic/logstash/pull/12307, we protect the Logstash process from crashing due to an error in a pipeline or pipeline worker.
The consequence of this is t…
-
Greetings.I installed v1.0.0 in Splunk Enterprise v9.0.0 today, but I'm afraid I can't get past this issue.
My config in /etc/apps/modinput_prometheus/local/inputs.conf is
```
[prometheus://kfk-a…
-
### Relevant telegraf.conf
```toml
# Configuration for telegraf agent
[agent]
## Default data collection interval for all inputs
interval = "5m"
## Rounds collection interval to 'interval'
…
-
```
python3 scripts/launch_triton_server.py --model_repo=/tensorrt_llm_backend/tensorrtllm_backend/triton_model_repo --world_size=1
root@ts-6ef92b20444c49e5b8ac415dd78856ff-launcher:/tensorrt_llm_b…
-
Team,
Since the day I have updated the AKS to v1.25.2, I can see huge spikes and node memory pressure issues.
Pods are going in evicted state and nodes are always consuming 135 to 140% of memor…
-
### Bug description
In loop mode, the `timeout` and `autodetection_retry` seem to do nothing.
`nvidia-smi` can sometimes take more than 15 seconds on some of our servers (even with persistence m…
-
Tracking updates of help.netflix.com
-
### Bug description
The plugin dosn´t add new metrics to netdata and seems crashing.
```sh
cd /usr/libexec/netdata/plugins.d/
sudo -u netdata -s
./go.d.plugin -d -m sensors
# Printed Logs:
…
-
**Describe the bug**
A clear and concise description of what the bug is.
**To Reproduce**
Steps to reproduce the behavior:
1. Go to '...'
2. Click on '....'
3. Scroll down to '....'
4. See er…