Closed vincenthuynh closed 5 months ago
Hi @vincenthuynh, how are you issuing the SIGHUP signal?
Are you specifying the nomad parent process ID? e.g.
➜ sudo kill -SIGHUP <pid>
Or are you using a tool like pkill
to kill any process of name nomad
?, e.g.
➜ sudo pkill -SIGHUP nomad
The later will send SIGHUP to the logging processes because they are also the nomad executable just executed with different args and would result in the behavior you're seeing.
Hi @shoenig, Thanks for following up!
We are running systemctl kill -s HUP nomad.service
which would have the same behaviour as the latter example. We'll update our ansible playbook to specify the parent process.
I'll close this issue.
Nomad version
v1.5.6
Operating system and Environment details
Debian
Issue
After rolling the TLS certificates in our Nomad cluster, all allocations have stopped logging. This is observed in the UI and using
nomad alloc logs <alloc id>
.The
logmon
anddocker_logging
process disappear on client VMs when the new certificate is reloaded via SIGHUP.The workaround is to restart the allocations or tasks.
Reproduction steps
Update client TLS certificates
Expected Result
Actual Result
Observed on each client. Likely related to making RPC calls and could no longer work:
Job file (if appropriate)
Nomad Server logs (if appropriate)
Nomad Client logs (if appropriate)
The following log patterns were found on the clients: