Open brentgriffin opened 3 years ago
yea I was thinking retry logic on cassandra @tuxranger
for whatever reason, this has happened to me three times already today :-(
in the janus.toml could you change the logging level from info to debug. I don't believe the info level gives warning messages which is what the retry logic messages are. I would like to double check if the logic is even running.
Running Janus with basic auth using cassandra as the persistence mechanism: Automated scripted deployment of Janus sporadically comes up in a bad state. This bad state is that connections to the admin port are accepted but they block until the client times out (no response is ever sent to the client). Requests through the api gateway port seem to be working properly.
When the system comes up in this state, it never recovers. The only way that I can get it working is to undeploy Janus and to redeploy it.
Not having the admin port available prevents the loading of basic user credentials.
Frequency: No hard numbers here but estimating it fails once every five to six deployments.
Possible cause: Looking at the logs, I see a timeout on accessing cassandra. Does not appear to ever retry the cassandra request.
Janus log when in bad state:
Janus log when the admin port works correctly: