Skip to content
This repository was archived by the owner on Nov 15, 2024. It is now read-only.

Failures during anti-entropy run when other nuntio instance is stopped #6

Open
vierbergenlars opened this issue Feb 23, 2022 · 0 comments
Labels
bug Something isn't working

Comments

@vierbergenlars
Copy link
Member

When there are multiple nuntio instances running in parallel on the same host (connected to the same consul node).

When one instance is stopped and it unregisters all health checks (default of nuntio.engine.shutdownMode), the other instance encounters problems when trying to update consul checks, resulting in checks not being active anymore until nuntio is restarted.

2022-02-23 12:55:31.182 ERROR 1 --- [pool-1-thread-1] e.xenit.nuntio.engine.AntiEntropyDaemon  : Exception during anti-entropy run

com.ecwid.consul.v1.OperationException: OperationException(statusCode=500, statusMessage='Internal Server Error', statusContent='CheckID "eu.xenit.nuntio.platform.docker.DockerContainerIdServiceIdentifier-fb363cdc8efdbbc416a102fd57991e184f0a2b69fae0aac68c636dfcadb082fe-0.0.0.0-tcp-80-heartbeat" does not have associated TTL')
	at com.ecwid.consul.v1.agent.AgentConsulClient.agentCheckPass(AgentConsulClient.java:211) ~[consul-api-1.4.5.jar!/:na]
	at com.ecwid.consul.v1.ConsulClient.agentCheckPass(ConsulClient.java:270) ~[consul-api-1.4.5.jar!/:na]
	at eu.xenit.nuntio.registry.consul.ConsulRegistry.lambda$updateCheck$9(ConsulRegistry.java:169) ~[nuntio-registry-consul-0.1.0.jar!/:na]
	at eu.xenit.nuntio.engine.metrics.RegistryOperationMetrics.lambda$record$0(RegistryOperationMetrics.java:27) ~[nuntio-engine-0.1.0.jar!/:na]
	at io.micrometer.core.instrument.AbstractTimer.record(AbstractTimer.java:149) ~[micrometer-core-1.7.3.jar!/:1.7.3]
	at eu.xenit.nuntio.engine.metrics.RegistryOperationMetrics.record(RegistryOperationMetrics.java:35) ~[nuntio-engine-0.1.0.jar!/:na]
	at eu.xenit.nuntio.engine.metrics.RegistryOperationMetrics.record(RegistryOperationMetrics.java:26) ~[nuntio-engine-0.1.0.jar!/:na]
	at eu.xenit.nuntio.engine.metrics.RegistryMetricsImpl.updateCheck(RegistryMetricsImpl.java:44) ~[nuntio-engine-0.1.0.jar!/:na]
	at eu.xenit.nuntio.registry.consul.ConsulRegistry.updateCheck(ConsulRegistry.java:156) ~[nuntio-registry-consul-0.1.0.jar!/:na]
	at eu.xenit.nuntio.engine.diff.DiffResolver.updateServiceChecks(DiffResolver.java:85) ~[nuntio-engine-0.1.0.jar!/:na]
	at eu.xenit.nuntio.engine.diff.DiffResolver.lambda$accept$1(DiffResolver.java:35) ~[nuntio-engine-0.1.0.jar!/:na]
	at java.base/java.util.Optional.ifPresent(Unknown Source) ~[na:na]
	at eu.xenit.nuntio.engine.diff.DiffResolver.accept(DiffResolver.java:34) ~[nuntio-engine-0.1.0.jar!/:na]
	at eu.xenit.nuntio.engine.diff.DiffResolver.accept(DiffResolver.java:19) ~[nuntio-engine-0.1.0.jar!/:na]
	at java.base/java.util.stream.ForEachOps$ForEachOp$OfRef.accept(Unknown Source) ~[na:na]
	at java.base/java.util.stream.ReferencePipeline$11$1.accept(Unknown Source) ~[na:na]
	at java.base/java.util.stream.ReferencePipeline$11$1.accept(Unknown Source) ~[na:na]
	at java.base/java.util.stream.SpinedBuffer$1Splitr.forEachRemaining(Unknown Source) ~[na:na]
	at java.base/java.util.stream.AbstractPipeline.copyInto(Unknown Source) ~[na:na]
	at java.base/java.util.stream.AbstractPipeline.wrapAndCopyInto(Unknown Source) ~[na:na]
	at java.base/java.util.stream.ForEachOps$ForEachOp.evaluateSequential(Unknown Source) ~[na:na]
	at java.base/java.util.stream.ForEachOps$ForEachOp$OfRef.evaluateSequential(Unknown Source) ~[na:na]
	at java.base/java.util.stream.AbstractPipeline.evaluate(Unknown Source) ~[na:na]
	at java.base/java.util.stream.ReferencePipeline.forEach(Unknown Source) ~[na:na]
	at eu.xenit.nuntio.engine.AntiEntropyDaemon.runAntiEntropy(AntiEntropyDaemon.java:52) ~[nuntio-engine-0.1.0.jar!/:na]
	at eu.xenit.nuntio.engine.AntiEntropyDaemon.lambda$configureTasks$3(AntiEntropyDaemon.java:70) ~[nuntio-engine-0.1.0.jar!/:na]
	at org.springframework.scheduling.support.DelegatingErrorHandlingRunnable.run(DelegatingErrorHandlingRunnable.java:54) ~[spring-context-5.3.9.jar!/:5.3.9]
	at org.springframework.scheduling.concurrent.ReschedulingRunnable.run(ReschedulingRunnable.java:95) ~[spring-context-5.3.9.jar!/:5.3.9]
	at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Unknown Source) ~[na:na]
	at java.base/java.util.concurrent.FutureTask.run(Unknown Source) ~[na:na]
	at java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(Unknown Source) ~[na:na]
	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) ~[na:na]
	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) ~[na:na]
	at java.base/java.lang.Thread.run(Unknown Source) ~[na:na]
@vierbergenlars vierbergenlars added the bug Something isn't working label Feb 23, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant