SUSE / DeepSea

A collection of Salt files for deploying, managing and automating Ceph.
GNU General Public License v3.0
161 stars 75 forks source link

stage.0 Exception: 'getpwuid(): uid not found: #1841

Open kuijpersj opened 4 years ago

kuijpersj commented 4 years ago

Description of Issue/Question

I'm trying to expand my CEPH cluster with some new nodes. While running stage.0 i'm getting en error on the existing master in stage 19.

deepsea salt-run state.orch ceph.stage.0

I hope someone can help me resolve this issue.

Setup

openSUSE Leap 15.0 (existing servers) openSUSE Leap 15.1 (new servers) CEPH Nautilus

Steps to Reproduce Issue

Run deepsea salt-run state.orch ceph.stage.0

`[19/87] ceph.processes on cro-nld-ams-am4-ceph-osd3.mgmt.crossyn.local................................... ✓ (2s) | cephprocesses.wait........................................................ ✓ cro-nld-grk-ceph-osd3.mgmt.crossyn.local....................................... ✓ (2s) | cephprocesses.wait........................................................ ✓ cro-nld-ams-am4-ceph-osd2.mgmt.crossyn.local................................... ✓ (2s) | cephprocesses.wait........................................................ ✓ cro-nld-ams-am4-ceph-master1.mgmt.crossyn.local................................ ✓ (2s) | cephprocesses.wait........................................................ ✓ cro-nld-ams-am4-ceph-master2.mgmt.crossyn.local................................ ✓ (2s) | cephprocesses.wait........................................................ ✓ cro-nld-grk-ceph-osd1.mgmt.crossyn.local....................................... ✓ (2s) | cephprocesses.wait........................................................ ✓ cro-nld-grk-ceph-osd2.mgmt.crossyn.local....................................... ✓ (2s) | cephprocesses.wait........................................................ ✓ cro-nld-ams-am4-ceph-osd1.mgmt.crossyn.local................................... ✓ (2s) | cephprocesses.wait........................................................ ✓ cro-nld-grk-ceph-master1.mgmt.crossyn.local.................................... ❌ (2s) |_ cephprocesses.wait........................................................ ❌

Ended stage: ceph.stage.0 succeeded=18/87 failed=1/87 time=177.9s

Failures summary:

ceph.processes (/srv/salt/ceph/processes): cro-nld-grk-ceph-master1.mgmt.crossyn.local: wait for all processes: Module function cephprocesses.wait threw an exception. Exception: 'getpwuid(): uid not found: 377000001'`

Run salt-call cephprocesses.check

[ERROR ] An un-handled exception was caught by salt's global exception handler: KeyError: 'getpwuid(): uid not found: 377000001' Traceback (most recent call last): File "/usr/bin/salt-call", line 11, in salt_call() File "/usr/lib/python3.6/site-packages/salt/scripts.py", line 431, in salt_call client.run() File "/usr/lib/python3.6/site-packages/salt/cli/call.py", line 57, in run caller.run() File "/usr/lib/python3.6/site-packages/salt/cli/caller.py", line 138, in run ret = self.call() File "/usr/lib/python3.6/site-packages/salt/cli/caller.py", line 237, in call ret['return'] = self.minion.executors[fname](self.opts, data, func, args, kwargs) File "/usr/lib/python3.6/site-packages/salt/executors/direct_call.py", line 12, in execute return func(*args, *kwargs) File "/var/cache/salt/minion/extmods/modules/cephprocesses.py", line 353, in check res.add(ProcInfo(running_proc), role) File "/var/cache/salt/minion/extmods/modules/cephprocesses.py", line 137, in init self.uid_name = pwd.getpwuid(self.uid).pw_name KeyError: 'getpwuid(): uid not found: 377000001' Traceback (most recent call last): File "/usr/bin/salt-call", line 11, in salt_call() File "/usr/lib/python3.6/site-packages/salt/scripts.py", line 431, in salt_call client.run() File "/usr/lib/python3.6/site-packages/salt/cli/call.py", line 57, in run caller.run() File "/usr/lib/python3.6/site-packages/salt/cli/caller.py", line 138, in run ret = self.call() File "/usr/lib/python3.6/site-packages/salt/cli/caller.py", line 237, in call ret['return'] = self.minion.executors[fname](self.opts, data, func, args, kwargs) File "/usr/lib/python3.6/site-packages/salt/executors/direct_call.py", line 12, in execute return func(args, **kwargs) File "/var/cache/salt/minion/extmods/modules/cephprocesses.py", line 353, in check res.add(ProcInfo(running_proc), role) File "/var/cache/salt/minion/extmods/modules/cephprocesses.py", line 137, in init self.uid_name = pwd.getpwuid(self.uid).pw_name KeyError: 'getpwuid(): uid not found: 377000001'

Versions Report

(Provided by running: salt-run deepsea.version 0.9.30+git.0.e2dea4531

rpm -qi salt-minion Name : salt-minion Version : 2019.2.0 Release : lp150.3.39.1 Architecture: x86_64 Install Date: Mon Jun 29 12:34:04 2020 Group : System/Management Size : 41019 License : Apache-2.0 Signature : RSA/SHA256, Thu Nov 7 18:12:06 2019, Key ID b88b2fd43dbdc284 Source RPM : salt-2019.2.0-lp150.3.39.1.src.rpm Build Date : Thu Nov 7 18:10:17 2019 Build Host : lamb14 Relocations : (not relocatable) Packager : http://bugs.opensuse.org Vendor : openSUSE URL : http://saltstack.org/ Summary : The client component for Saltstack Description : Salt minion is queried and controlled from the master. Listens to the salt master and execute the commands. Distribution: openSUSE Leap 15.0

rpm -qi salt-master Name : salt-master Version : 2019.2.0 Release : lp150.3.39.1 Architecture: x86_64 Install Date: Mon Jun 29 12:32:19 2020 Group : System/Management Size : 2936818 License : Apache-2.0 Signature : RSA/SHA256, Thu Nov 7 18:12:05 2019, Key ID b88b2fd43dbdc284 Source RPM : salt-2019.2.0-lp150.3.39.1.src.rpm Build Date : Thu Nov 7 18:10:17 2019 Build Host : lamb14 Relocations : (not relocatable) Packager : http://bugs.opensuse.org Vendor : openSUSE URL : http://saltstack.org/ Summary : The management component of Saltstack with zmq protocol supported Description : The Salt master is the central server to which all minions connect. Enabled commands to remote systems to be called in parallel rather than serially. Distribution: openSUSE Leap 15.0

)