OpenSIPS / opensips

OpenSIPS is a GPL implementation of a multi-functionality SIP Server that targets to deliver a high-level technical solution (performance, security and quality) to be used in professional SIP server platforms.
https://opensips.org
Other
1.26k stars 576 forks source link

clusterer: the clusterer ping process is dying if the other node is down #1609

Closed ovidiusas closed 5 years ago

ovidiusas commented 5 years ago

How to reproduce:

Example:

  modparam("clusterer", "my_node_id", 2)
  modparam("clusterer", "db_mode", 0)
  modparam("clusterer", "my_node_info", "cluster_id=1, url=bin:192.168.2.61:5566")
  modparam("clusterer", "neighbor_node_info", "cluster_id=1,node_id=1,url=bin:192.168.2.160:5566")
  modparam("clusterer", "sharing_tag", "reg_osip/1=active")

After a while, the logs will show this:

Feb  4 11:50:06 asrock o[29074]: INFO:core:probe_max_sock_buff: using snd buffer of 416 kb
Feb  4 11:50:06 asrock o[29074]: INFO:core:init_sock_keepalive: TCP keepalive enabled on socket 7
Feb  4 11:50:06 asrock o[29075]: CRITICAL:core:handle_worker: dead child 16 (EOF received), pid 29074
rvlad-patrascu commented 5 years ago

Hi, This does not reproduce on my system. Can you provide some more details? For example: any other errors in the log, how long does it take to receive that critical error, any modules configured with replication.

ovidiusas commented 5 years ago

Can't reproduce it on the latest master. Probably was fixed while fixing other issues. Closing it for now.