xenit-eu / nuntio

A bridge between Docker containers and the Consul service catalog.
2 stars 0 forks source link

Failures during anti-entropy run when other nuntio instance is stopped #6

Open vierbergenlars opened 2 years ago

vierbergenlars commented 2 years ago

When there are multiple nuntio instances running in parallel on the same host (connected to the same consul node).

When one instance is stopped and it unregisters all health checks (default of nuntio.engine.shutdownMode), the other instance encounters problems when trying to update consul checks, resulting in checks not being active anymore until nuntio is restarted.

2022-02-23 12:55:31.182 ERROR 1 --- [pool-1-thread-1] e.xenit.nuntio.engine.AntiEntropyDaemon  : Exception during anti-entropy run

com.ecwid.consul.v1.OperationException: OperationException(statusCode=500, statusMessage='Internal Server Error', statusContent='CheckID "eu.xenit.nuntio.platform.docker.DockerContainerIdServiceIdentifier-fb363cdc8efdbbc416a102fd57991e184f0a2b69fae0aac68c636dfcadb082fe-0.0.0.0-tcp-80-heartbeat" does not have associated TTL')
    at com.ecwid.consul.v1.agent.AgentConsulClient.agentCheckPass(AgentConsulClient.java:211) ~[consul-api-1.4.5.jar!/:na]
    at com.ecwid.consul.v1.ConsulClient.agentCheckPass(ConsulClient.java:270) ~[consul-api-1.4.5.jar!/:na]
    at eu.xenit.nuntio.registry.consul.ConsulRegistry.lambda$updateCheck$9(ConsulRegistry.java:169) ~[nuntio-registry-consul-0.1.0.jar!/:na]
    at eu.xenit.nuntio.engine.metrics.RegistryOperationMetrics.lambda$record$0(RegistryOperationMetrics.java:27) ~[nuntio-engine-0.1.0.jar!/:na]
    at io.micrometer.core.instrument.AbstractTimer.record(AbstractTimer.java:149) ~[micrometer-core-1.7.3.jar!/:1.7.3]
    at eu.xenit.nuntio.engine.metrics.RegistryOperationMetrics.record(RegistryOperationMetrics.java:35) ~[nuntio-engine-0.1.0.jar!/:na]
    at eu.xenit.nuntio.engine.metrics.RegistryOperationMetrics.record(RegistryOperationMetrics.java:26) ~[nuntio-engine-0.1.0.jar!/:na]
    at eu.xenit.nuntio.engine.metrics.RegistryMetricsImpl.updateCheck(RegistryMetricsImpl.java:44) ~[nuntio-engine-0.1.0.jar!/:na]
    at eu.xenit.nuntio.registry.consul.ConsulRegistry.updateCheck(ConsulRegistry.java:156) ~[nuntio-registry-consul-0.1.0.jar!/:na]
    at eu.xenit.nuntio.engine.diff.DiffResolver.updateServiceChecks(DiffResolver.java:85) ~[nuntio-engine-0.1.0.jar!/:na]
    at eu.xenit.nuntio.engine.diff.DiffResolver.lambda$accept$1(DiffResolver.java:35) ~[nuntio-engine-0.1.0.jar!/:na]
    at java.base/java.util.Optional.ifPresent(Unknown Source) ~[na:na]
    at eu.xenit.nuntio.engine.diff.DiffResolver.accept(DiffResolver.java:34) ~[nuntio-engine-0.1.0.jar!/:na]
    at eu.xenit.nuntio.engine.diff.DiffResolver.accept(DiffResolver.java:19) ~[nuntio-engine-0.1.0.jar!/:na]
    at java.base/java.util.stream.ForEachOps$ForEachOp$OfRef.accept(Unknown Source) ~[na:na]
    at java.base/java.util.stream.ReferencePipeline$11$1.accept(Unknown Source) ~[na:na]
    at java.base/java.util.stream.ReferencePipeline$11$1.accept(Unknown Source) ~[na:na]
    at java.base/java.util.stream.SpinedBuffer$1Splitr.forEachRemaining(Unknown Source) ~[na:na]
    at java.base/java.util.stream.AbstractPipeline.copyInto(Unknown Source) ~[na:na]
    at java.base/java.util.stream.AbstractPipeline.wrapAndCopyInto(Unknown Source) ~[na:na]
    at java.base/java.util.stream.ForEachOps$ForEachOp.evaluateSequential(Unknown Source) ~[na:na]
    at java.base/java.util.stream.ForEachOps$ForEachOp$OfRef.evaluateSequential(Unknown Source) ~[na:na]
    at java.base/java.util.stream.AbstractPipeline.evaluate(Unknown Source) ~[na:na]
    at java.base/java.util.stream.ReferencePipeline.forEach(Unknown Source) ~[na:na]
    at eu.xenit.nuntio.engine.AntiEntropyDaemon.runAntiEntropy(AntiEntropyDaemon.java:52) ~[nuntio-engine-0.1.0.jar!/:na]
    at eu.xenit.nuntio.engine.AntiEntropyDaemon.lambda$configureTasks$3(AntiEntropyDaemon.java:70) ~[nuntio-engine-0.1.0.jar!/:na]
    at org.springframework.scheduling.support.DelegatingErrorHandlingRunnable.run(DelegatingErrorHandlingRunnable.java:54) ~[spring-context-5.3.9.jar!/:5.3.9]
    at org.springframework.scheduling.concurrent.ReschedulingRunnable.run(ReschedulingRunnable.java:95) ~[spring-context-5.3.9.jar!/:5.3.9]
    at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Unknown Source) ~[na:na]
    at java.base/java.util.concurrent.FutureTask.run(Unknown Source) ~[na:na]
    at java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(Unknown Source) ~[na:na]
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) ~[na:na]
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) ~[na:na]
    at java.base/java.lang.Thread.run(Unknown Source) ~[na:na]