Open kentan88 opened 9 months ago
Having the same problem with OpenSearch K8s operator and I have to restart fluentd daemon set to fix the problem every time.
@kentan88
Have you tried setting reload_on_failure
to true
?
I saw this option in README, I will test it and I think this will resolve the issue :)
setting reload_on_failure
to true
did not fixed the problem
livenessProbe:
httpGet: null
initialDelaySeconds: 5
periodSeconds: 10
timeoutSeconds: 5
exec:
command:
- bash
- -c
- >
set -ex;
curl -s http://localhost:24231/metrics
| grep -E "fluentd_output_status_retry_wait|fluentd_output_status_num_errors|fluentd_output_status_retry_count"
| grep -Ev "# HELP|# TYPE"
| grep -v "0.0"
| wc -l | grep 0
I have added these values into the daemonset helm chart it should restart containers when retry or error happens
(Do not forget to install curl
in your docker image)
Steps to replicate
Provide example config and message Dockerfile
fluent.conf
Expected Behavior or What you need to ask
I'm running a local Docker which uses fluent/fluentd:v1.15-debian-1 as the base image. When I ran the container, i'm getting the following message:
I can confirm that the AWS credentials and AWS OpenSearch Serverless endpoint are correct and also reachable as I was able to send data using a ruby OpenSearch client.
Any help would be much appreciated. ...
Using Fluentd and OpenSearch plugin versions