OpenClovis / SAFplus-Availability-Scalability-Platform

Middleware that provides libraries, GUI, and code generator to design multi-node (clustered) applications that are highly available, redundant, and scalable. Provides sub-second node and application fault detection and failover, and useful application libraries including distributed hash tables (checkpoint), event, logging, and communications. Implements SA-Forum APIs where applicable. Used anywhere reliability is a must -- like telecom, wireless, defense and enterprise computing. Download stable release with installer from: ftp.openclovis.com
www.openclovis.com
GNU General Public License v2.0
20 stars 13 forks source link

Inconsistent cluster view after split brain restart/recovery #7

Closed karthick18 closed 12 years ago

karthick18 commented 12 years ago

During a split brain recovery test with UDP transport, it was found that one of the controllers had an inconsistent view after split brain recovery. It didn't see one of the nodes in the cluster after recovery. This was observed after a split brain restart triggered from the controller being restarted (last lower node id standby). The controller that got restarted had he inconsistent view. This issue is not always reproducible.

karthick18 commented 12 years ago

This has been addressed with commit 66eb942b18754b762caaa21206efa36e67170261 Also addressed in 6.0 with commit 61e35628b0423423309fbeb202cff5496950eff1