RedHatSatellite / satellite-sanity

Red Hat Satellite health check
GNU General Public License v3.0
20 stars 19 forks source link

Sat6: sat6_hammer_ping fails on pulp_auth on build: 6.3.2 #50

Closed WRvdTuin closed 6 years ago

WRvdTuin commented 6 years ago

We use Nagios the monitor the satelitte server The Nagios check sat6_hammer_ping of the Satellite sanity checks fails /usr/local/bin/sat6_nagios is the same as satellite-sanity from this repo

/usr/local/bin/sat6_nagios --rules sat6_hammer_ping --tag Satellite_6 INFO Rule sat6_hammer_ping returned {'failed_services': set(['pulp_auth'])} [ FAIL ] Ensure that hammer sees all the services as running (sat6_hammer_ping) You have 1 services hammer can not reach: pulp_auth See https://access.redhat.com/solutions/1517513

However the command hammer ping shows: [root@sscc-satl6-l01p satellite_sanity_lib]# hammer ping candlepin: Status: ok Server Response: Duration: 14ms candlepin_auth: Status: ok Server Response: Duration: 15ms pulp: Status: ok Server Response: Duration: 57ms pulp_auth: Status: ok Server Response: Duration: 23ms foreman_tasks: Status: ok Server Response: Duration: 731ms

I have raised a RedHat Support call but they direct me to this site Can you help debug and fix this issue ?

WRvdTuin commented 6 years ago

i have looked into the code and changed a line in rules/sat6_hammer_ping.py line 16

expected_services = ('candlepin', 'candlepin_auth', 'pulp', 'foreman_tasks')

expected_services = ('candlepin', 'candlepin_auth', 'pulp', 'foreman_tasks','pulp_auth')

/usr/local/bin/sat6_nagios --rules sat6_hammer_ping --tag Satellite_6 INFO Rule sat6_hammer_ping returned None [ PASS ] Ensure that hammer sees all the services as running (sat6_hammer_ping)

It is OKE now

jhutar commented 6 years ago

Thank you for your report and fix! Please note we do not plan any extra work being done on this project. Sanity checks for Foreman and Satellite 6 are currently gathered in this tool: https://github.com/theforeman/foreman_maintain (also packaged for Sat6).