-
I'm using Nomad to deploy a cluster of rethinkdb containers. I tried several different Docker images and Nomad configs but each container comes up as a leader and never joins the cluster/leader. The l…
-
#### [FEA] API to write dask dataframes to local storage of each node in multi-node cluster
### Example requested API:
```python
df.to_parquet(xxx, write_locally_per_node=True)
```
**Pleas…
-
Add [`ClusterEnvironment`](https://lightning.ai/docs/pytorch/stable/api_references.html#environments) to support distributed training on Vertex AI.
In particular, it seems `os.environ["WORLD_SIZE"]…
-
### Describe the feature
Failures on 6.2.0
Java:
```
glide.SharedCommandTests.hrandfieldBinary(BaseClient)
glide.SharedCommandTests.zrangestore_by_lex(BaseClient)
glide.SharedCommandTests.…
-
### Terraform Core Version
1.7.5
### AWS Provider Version
5.44.0
### Affected Resource(s)
aws_elasticache_replication_group
### Expected Behavior
When setting availability_zones to a different …
-
When PCR starts, 8 workers in the destination cluster are allocated for each node in the source cluster. Each worker can consume a few hundred MiB of unaccounted memory. There a few scenarios where th…
-
HEAD: dc535978b9f5d97cf5c5d3ffa1448197d88294b5
## Description
The code that generates tokens reads existing token ring information as follows
```
- name: Get existing tokens
block:
- name: Get…
-
Hi,
I have a multi node setup with multiple GPU. I was able to get the cluster but I don't see the remaining GPU's from each nodes. How do I do that. Also observed below error while using llama…
-
## Overview
We need a loose, multi-cluster diagram for a geo-distributed 10k user architecture as the first step in executing a 10k user reference architecture. This diagram would not go directly into…
-
This issues been filed to examine how best to support the `inference-service-test` plugin in ES|QL mixed version testing.
The ES|QL CSV and REST tests run with a variety of modes (see `x-pack/plugin/…