-
When PCR starts, 8 workers in the destination cluster are allocated for each node in the source cluster. Each worker can consume a few hundred MiB of unaccounted memory. There a few scenarios where th…
-
Hi,
I have a multi node setup with multiple GPU. I was able to get the cluster but I don't see the remaining GPU's from each nodes. How do I do that. Also observed below error while using llama…
-
HEAD: dc535978b9f5d97cf5c5d3ffa1448197d88294b5
## Description
The code that generates tokens reads existing token ring information as follows
```
- name: Get existing tokens
block:
- name: Get…
-
## Overview
We need a loose, multi-cluster diagram for a geo-distributed 10k user architecture as the first step in executing a 10k user reference architecture. This diagram would not go directly into…
-
This issues been filed to examine how best to support the `inference-service-test` plugin in ES|QL mixed version testing.
The ES|QL CSV and REST tests run with a variety of modes (see `x-pack/plugin/…
-
How to use RayVecEnv in cluster? I want to run my rl code using multi-nodes training, I'm new to ray, is there some demos scripts?
-
_This issue was originally opened by @mmccord-mdbuyline as hashicorp/terraform#23660. It was migrated here as a result of the [provider split](https://www.hashicorp.com/blog/upcoming-provider-changes-…
ghost updated
2 months ago
-
**What I'd like:**
NVIDIA time-slicing landed (see #2347) in [Bottlerocket 1.25](https://github.com/bottlerocket-os/bottlerocket/blob/develop/CHANGELOG.md#v1250-2024-10-15). While a step forwar…
-
### What problem are you trying to solve?
We have a clustered service, which at boot time requires all the cluster nodes to establish connections with each other prior to the service (or any of the P…
-
### Problem Description
If I add a NodeFeature to a module that is not in the list [here](https://github.com/elastic/elasticsearch/blob/main/x-pack/plugin/ml/qa/native-multi-node-tests/src/javaRestTe…