Open deedarb opened 3 years ago
error occurrence is not regular, after investigating logs: exceptions in february month found in logs from 4th until 11th, this is strange... i have eight services running on two VM's and one consul agent running on one of VM.
https://groups.google.com/g/vertx/c/3IVhc5DJNlA
there is answer from Thomas (tsegismont)
OK so it happens when you restart nodes, and I guess you restart a lot of them at the same time. One way to avoid loosing clustered data (event bus subscriptions and haInfo) is to increase the number of backups for the corresponding multimap (vertx.subs) and map (vertx.haInfo). Beware that versions up to 3.3.3 have a bug in the default cluster setup file (https://github.com/vert-x3/vertx-hazelcast/commit/4b6e6f815a221306ae93c500b5e62d8c1452397a)
We are working for a fix in 3.4
it could be cause for problem because during updating apps we have bash script which stops all apps and copies new jars and starts them, maybe old subs are left in consul __vertx.subs/haInfo
i have strange error with clustered event bus handler (using reactiverse/consul-cluster-manager and vertx 3.9.5)
where should i dig to find cause? asked for help in https://gitter.im/eclipse-vertx/vertx-users - no response :| may be app sometimes leaves cluster and that causes exception, how can i prove/fix that? is it somehow network related? consul and apps on same machine