Open pbirkants opened 4 months ago
Please, when you have questions or problems use the community forum before opening new issues, thank you.
You can disable the health checks to avoid the bridge being removed if you do not have multiple bridges and autoscaling.
Jicofo fails to resume jvb health checks once they fail. This is fine in most environments where we use sticky-failures=true
which is why we haven't noticed before.
Thank you for reopening this issue.
I'd like to add that I'm using the defaults for any related settings for both Jicofo and JVB, which, I believe, are sticky-failures=false
.
Disabling health checks does not seem like a good solution, as that could make it difficult to detect when the bridge is actually down.
This bug affects me too. Is there currently a planned timeline for a fix?
It would be wrong to handle this bug by turning off health checks. I see that nobody is assigned to solve this. I'd like to know if there will be some movement regarding this in the near future..
Description
Bridge link between Jicofo and JVB sometimes is terminated when host is under heavy load by other processes, but later never recovers, preventing any Jitsi calls from working until manually restarted.
Current behavior
If the healthcheck takes too long, the JVB node is dropped and never resumed, even though it's running fine.
Here are the relevant Jicofo and JVB logs from the time period, nothing else was recorded before or after this (until JVB was restarted manually).
Expected Behavior
The bridge connection should be recovered automatically.
Steps to reproduce
Not sure how to reproduce this reliably, it has happened two or three times over several months during the night, when Jitsi is completely idle, but other processes running on the host are causing significant system load.
Environment details
All Jitsi components installed locally on a single server, with a single bridge used.
APT package versions (but this has happened with earlier versions, too):