-
MicroProfile Fault Tolerance allows users to easily apply strategies for mitigating failures in their code. It provides annotations which you can add to methods to use bulkhead, circuit breake…
-
### What you would like to be added?
Since @andreyvelich commented:
> Unfortunately, we don't have good docs right now about our ElasticPolicy: [https://github.com/kubeflow/training-operator/bl…
-
### Is your feature request related to a problem?
There doesn't seem to be a way to configure retries in the Java client. This feature is present in the [.NET client](https://opensearch.org/docs/la…
-
At present, the update scripts operate directly on their target locations, meaning that if something goes wrong, the server is broken until manual intervention.
This is not ideal. The update scripts…
-
# General Requirements
- [ ] compile-time enabled
- [ ] simulating failures
# Functionality
Register callbacks for
- [ ] communicator revoked
- [ ] failure
- [ ] both
-
I would like to report an issue on page https://opensource.docs.scylladb.com/branch-5.4/architecture/architecture-fault-tolerance
### Problem
This page is essential and at the core of the database…
-
## Description
MicroProfile Fault Tolerance update to work with MicroProfile Telemetry Metrics as well as MicroProfile Metrics
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -…
-
How to properly test fault tolerance of ZFS and what utilities do you use for this?
I want to test disk-failure / power failure tolerance of ZFS.
I tried standard "faulty" tool to emulate disk IO …
-
The Fault Tolerance 4.1 feature file needs to depend on the correct versions of other features. We noted when reviewing #28188 that it didn't, but wanted to get that PR merged in.
-
Now, the elastic scheduling in DLRover ElasticJob is suitable for asynchronous SGD of recommendation model training but not sync SGD. In a sync SGD job, the training cannot start is the number of node…