Closed bkraul closed 4 years ago
This happened again, with the exact same situation. This time, I tried to use a docker restart mailserver
command to restart the mailserver container. It failed twice with the exact same error, but on the third time, it worked OK and it resumed operation. I am pretty sure this is supervisor crashing and not being able to be restarted. I just wish I knew how to bring it back automatically when it crashes.
Hi @bkraul ,
I usually, run the following command: docker stop mailserver && docker start mailserver
. if I made changes in the docker file then docker stop mailserver && docker rm mailserver && docker-compose up -d mailserver
I hope this helps you for restarting your mailserver container safely. Regarding the problem with the crash, are you seeing something in your host machine logs (like in the message.log, systlog)?
@sknight80 Yes, it is the same number of steps I follow. Checked the host logs. I could not find anything remotely related.
This issue has been marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Well, not sure what more activity it can have beyond being reported. This continues to happen. I am suspecting it is load-based, as I am seeing multiple denied relay attempts when it crashes. The problem is not that any service crashes, though, the problem is that supervisor crashes and it does not listen anymore so nothing can be restarted.
Need further investigation on what could cause instabilities. Upload all information about processes and logs here before and after crashes occur. Try with another hardware & host computer too.
@bkraul can you test again with the new image under Debian Buster (use 1.1-latest tag
) ? I have updated s6 (skarnet's small supervision suite) as well.
Just updated. Will monitor and let you know. I wish this was easily reproducible but as I said, it seems load-based.
I just had a similar problem and a container restart fix it. Here's my log with real domain name replaced:
Attaching to rainloop, postfixadmin, mailserver
postfixadmin | PHP 7.3.9 Development Server started at Sun Oct 20 07:02:38 2019
postfixadmin | [Sun Oct 20 07:02:52 2019] 172.20.0.2:46092 [302]: /
postfixadmin | [Sun Oct 20 07:02:52 2019] 172.20.0.2:46118 [200]: /login.php
postfixadmin | [Sun Oct 20 07:02:52 2019] 172.20.0.2:46156 [200]: /login.php
postfixadmin | [Sun Oct 20 07:02:52 2019] 172.20.0.2:46162 [200]: /images/logo-default.png
postfixadmin | [Sun Oct 20 07:02:52 2019] 172.20.0.2:46160 [200]: /css/default.css
postfixadmin | [Sun Oct 20 07:02:52 2019] 172.20.0.2:46202 [200]: /css/calendar.css
postfixadmin | [Sun Oct 20 07:02:53 2019] 172.20.0.2:46226 [200]: /images/mail_bg.gif
postfixadmin | [Sun Oct 20 07:02:53 2019] 172.20.0.2:46252 [200]: /images/favicon.ico
postfixadmin | [Sun Oct 20 07:02:55 2019] 172.20.0.2:46496 [302]: /login.php
postfixadmin | [Sun Oct 20 07:02:55 2019] 172.20.0.2:46550 [200]: /main.php
postfixadmin | [Sun Oct 20 07:02:55 2019] 172.20.0.2:46588 [200]: /css/default.css
postfixadmin | [Sun Oct 20 07:02:55 2019] 172.20.0.2:46590 [200]: /images/logo-default.png
postfixadmin | [Sun Oct 20 07:02:56 2019] 172.20.0.2:46608 [200]: /css/calendar.css
postfixadmin | [Sun Oct 20 08:26:12 2019] 172.20.0.2:54316 [302]: /
postfixadmin | [Sun Oct 20 08:26:12 2019] 172.20.0.2:54322 [200]: /login.php
postfixadmin | [Sun Oct 20 08:26:12 2019] 172.20.0.2:54332 [200]: /images/favicon.ico
postfixadmin | [Sun Oct 20 09:59:51 2019] 172.20.0.2:40636 [302]: /
postfixadmin | [Sun Oct 20 09:59:52 2019] 172.20.0.2:40652 [200]: /login.php
postfixadmin | [Sun Oct 20 09:59:52 2019] 172.20.0.2:40670 [200]: /images/logo-default.png
postfixadmin | [Sun Oct 20 09:59:52 2019] 172.20.0.2:40672 [200]: /css/default.css
postfixadmin | [Sun Oct 20 09:59:52 2019] 172.20.0.2:40700 [200]: /login.php
postfixadmin | [Sun Oct 20 09:59:52 2019] 172.20.0.2:40728 [200]: /css/calendar.css
postfixadmin | [Sun Oct 20 09:59:52 2019] 172.20.0.2:40738 [200]: /images/mail_bg.gif
postfixadmin | [Sun Oct 20 10:00:03 2019] 172.20.0.2:41746 [302]: /login.php
postfixadmin | [Sun Oct 20 10:00:04 2019] 172.20.0.2:42024 [200]: /main.php
postfixadmin | [Sun Oct 20 10:00:04 2019] 172.20.0.2:42064 [200]: /css/default.css
postfixadmin | [Sun Oct 20 10:00:04 2019] 172.20.0.2:42066 [200]: /images/logo-default.png
postfixadmin | [Sun Oct 20 10:00:04 2019] 172.20.0.2:42080 [200]: /css/calendar.css
postfixadmin | [Sun Oct 20 10:00:09 2019] 172.20.0.2:42570 [200]: /list.php?table=admin
postfixadmin | [Sun Oct 20 10:00:09 2019] 172.20.0.2:42598 [200]: /css/default.css
postfixadmin | [Sun Oct 20 10:00:09 2019] 172.20.0.2:42600 [200]: /images/logo-default.png
postfixadmin | [Sun Oct 20 10:00:09 2019] 172.20.0.2:42620 [200]: /css/calendar.css
postfixadmin | [Sun Oct 20 10:00:17 2019] 172.20.0.2:43392 [200]: /list-virtual.php
postfixadmin | [Sun Oct 20 10:00:17 2019] 172.20.0.2:43426 [200]: /css/default.css
postfixadmin | [Sun Oct 20 10:00:17 2019] 172.20.0.2:43428 [200]: /images/logo-default.png
postfixadmin | [Sun Oct 20 10:00:17 2019] 172.20.0.2:43448 [200]: /css/calendar.css
postfixadmin | [Sun Oct 20 10:00:24 2019] 172.20.0.2:44088 [200]: /edit.php?table=mailbox&edit=myname%40mydomain.com
postfixadmin | [Sun Oct 20 10:00:24 2019] 172.20.0.2:44104 [200]: /css/default.css
postfixadmin | [Sun Oct 20 10:00:24 2019] 172.20.0.2:44106 [200]: /images/logo-default.png
postfixadmin | [Sun Oct 20 10:00:24 2019] 172.20.0.2:44118 [200]: /css/calendar.css
postfixadmin | [Sun Oct 20 10:00:28 2019] 172.20.0.2:44454 [200]: /list-virtual.php
postfixadmin | [Sun Oct 20 10:24:43 2019] 172.20.0.2:50262 [302]: /
postfixadmin | [Sun Oct 20 10:24:44 2019] 172.20.0.2:50276 [200]: /login.php
postfixadmin | [Sun Oct 20 10:24:44 2019] 172.20.0.2:50292 [200]: /css/default.css
postfixadmin | [Sun Oct 20 10:24:44 2019] 172.20.0.2:50294 [200]: /images/logo-default.png
postfixadmin | [Sun Oct 20 10:46:43 2019] 172.20.0.2:45142 [302]: /
postfixadmin | [Sun Oct 20 10:46:43 2019] 172.20.0.2:45158 [200]: /login.php
postfixadmin | [Sun Oct 20 10:46:43 2019] 172.20.0.2:45184 [200]: /images/logo-default.png
postfixadmin | [Sun Oct 20 10:46:43 2019] 172.20.0.2:45186 [200]: /css/default.css
mailserver | [INFO] MariaDB/PostgreSQL hostname not found in /etc/hosts
mailserver | [INFO] Container IP found, adding a new record in /etc/hosts
mailserver | [INFO] Redis hostname not found in /etc/hosts
mailserver | [INFO] Container IP found, adding a new record in /etc/hosts
mailserver | [INFO] Search for SSL certificates generated by Traefik
mailserver | [INFO] acme.json found with ACME v2 format, dumping into pem files
mailserver | [INFO] Let's encrypt wildcard certificate found
mailserver | [INFO] Setting up container
mailserver | [INFO] Found DKIM key pair for domain mydomain.com - skip creation
mailserver | [INFO] Debug mode is disabled
mailserver | [INFO] ClamAV is disabled, service will not start
mailserver | [INFO] Fetchmail forwarding is disabled
mailserver | [INFO] Automatic GPG encryption is disabled
mailserver | [INFO] ManageSieve protocol is enabled
mailserver | [INFO] DKIM/ARC signing is enabled
mailserver | [INFO] Greylisting policy is enabled
mailserver | [INFO] Ratelimiting policy is disabled
mailserver | [INFO] POP3 protocol is disabled
mailserver | [INFO] Unbound DNS resolver is enabled
mailserver | [INFO] clamav-unofficial-sigs is disabled (user configuration not found)
mailserver | [INFO] Finished container setup
mailserver | [INFO] Starting services
mailserver | s6-svc: fatal: unable to control /services/rsyslogd: supervisor not listening
mailserver | [INFO] Checking for watchable SSL certificates
mailserver | [INFO] Watching /etc/letsencrypt/acme
^CTraceback (most recent call last):
This issue is reproducible at my test system, but only with about 15% probability. So if I run the test suite 10 times, it normally happens once or twice.
I am not seeing anything in the logs, but that is probably because the parts responsible have logging disabled or as rsyslogd itself is not running, is not logged?
But I can see some processes not running that normally should be running in that container. These are the processes running in the normal container, but not the crashed one:
root 1385 0.0 0.0 156096 3124 ? Ssl 15:20 0:00 rsyslogd -n -f /etc/rsyslog/rsyslog.conf
postfix 1541 0.0 0.1 53092 13888 ? S 15:22 0:00 smtpd -n smtp -t inet -u -o stress= -s 2
postfix 1542 0.0 0.0 43496 7252 ? S 15:22 0:00 proxymap -t unix -u
postfix 1544 0.0 0.0 52300 9984 ? S 15:22 0:00 trivial-rewrite -n rewrite -t unix -u
postfix 1545 0.0 0.1 52536 10024 ? S 15:22 0:00 cleanup -z -t unix -u
postfix 1546 0.0 0.0 43784 7380 ? S 15:22 0:00 lmtp -t unix -u
postfix 1572 0.0 0.0 43492 7128 ? S 15:22 0:00 scache -l -t unix -u
postfix 1586 0.0 0.1 53228 13792 ? S 15:22 0:00 smtpd -n submission -t inet -u -o stress= -s 2 -o syslog_name=postfix/submission-custom -o smtpd_tls_security_level=encrypt -o smtpd_tls_ciphers=high -o smtpd_client_restrictions=permit_sasl_authenticated,reject -o cleanup_
postfix 1587 0.0 0.1 52992 10732 ? S 15:22 0:00 cleanup -z -n authclean -t unix -u -o syslog_name=postfix/authclean -o header_checks=pcre:/etc/postfix/header_checks
Maybe this helps to narrow the problem in. If someone can tell me how to enable more verbous logs, I can test that aswell.
Sidenote: I am working on a base image update to the latest version of s6 for example.
Related to your question: I don't know yet, but I can Google it
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Classification
Please delete options that are not relevant.
Reproducibility
Random stoppage of the mailserver with the following significant log entries:
It would seem as if supervisor is crashing and not coming back.
Docker information
Description
Briefly describe the problem you are having in a few lines.
Steps to reproduce
Expected results
No unexpected supervisor crash.
Actual results
Supervisor crashes, docker container restart is needed.
Debugging information
Configuration (docker-compose.yml, traefik.toml...etc)