sni / mod_gearman

Distribute Naemon Host/Service Checks & Eventhandler with Gearman Queues. Host/Servicegroups affinity included.
http://www.mod-gearman.org
GNU General Public License v3.0
122 stars 42 forks source link

mod_gearman on naemon #55

Closed dcec closed 10 years ago

dcec commented 10 years ago

On OMD with naemon rrdcache and pnp4nagios all working, when enable gearman and pnp4nagios on gearman the performance not working anymore.

2014-02-18 16:53:14 - localhost:4755 - v0.33

Queue Name | Worker Available | Jobs Waiting | Jobs Running

check_results | 1 | 0 | 0 eventhandler | 5 | 0 | 0 host | 5 | 0 | 0 perfdata | 2 | 0 | 0 service | 5 | 0 | 0 worker_localhost | 1 | 0 | 0

On neb log not find any error.

[2014-02-18 16:48:53][2448][TRACE] --------------- host Job -> 7, 804 [2014-02-18 16:48:53][2448][DEBUG] received job for queue host: prova [2014-02-18 16:48:53][2448][DEBUG] host: 'prova', next_check is at 2014-02-18 16:48:53, latency so far: 0 [2014-02-18 16:48:53][2448][TRACE] cmd_line: /omd/sites/provaok/lib/nagios/plugins/check_icmp -H 127.0.0.1 -w 3000.0,80% -c 5000.0,100% -p 5 [2014-02-18 16:48:53][2448][TRACE] add_job_to_queue(host, prova, 2, 1, 1, 1) [2014-02-18 16:48:53][2448][TRACE] 249 --->type=host result_queue=check_results host_name=prova start_time=1392738533.0 next_check=1392738533.0 timeout=30 core_time=1392738533.2642 command_line=/omd/sites/provaok/lib/nagios/plugins/check_icmp -H 127.0.0.1 -w 3000.0,80% -c 5000.0,100% -p 5

<--- [2014-02-18 16:48:53][2448][TRACE] 344 +++> kR0peh1GWEAlkNvoygET0Uk8Dz3cXuyAqGl4sbArOPr26IdBiSRYWCpCHs6Dt5M6/uwem9NxgHYiiJZah8Za2IOTB31RkRfK3638iSK10wvT7JPvMo+yxPif0gjK+rZPDyGzSItJIb6tWpXDFPYbJmlquFI7JnbIb7w81/5n3iZakGHxQ3RrNCgkXQDP35+o8ycSnBIh+a9qrLkU2qVdgMLX/pYQoVNuAJc+CQZvLAyz2aFit8vPpt8n/H2NhrEXYVSnn1zvqCUObT6iAUCkAhpM9oVgIrpMxMrqAXYJEvfW4RcQ+B1NpClhUNv4NKTAdEooIEqJHce+bfEkXOgTwA== <+++ [2014-02-18 16:48:53][2448][TRACE] add_job_to_queue() finished successfully: 0 0 [2014-02-18 16:48:53][2448][TRACE] handle_host_check() finished successfully -> 206 [2014-02-18 16:48:53][2448][TRACE] got result H:portale.solumonitoring.it:5 [2014-02-18 16:48:53][2448][TRACE] 364 +++> FZVqHLJEd9TbHwmjp7oVXtFudvDn0UJQNDsJUepuzKlWDLecKcl6J9dI0hhx44Uhr90N0ReKrQam6DqUstc69MmfniobFAhX1jm774v2cMw9AgCBogAW1bZHt+jn0Co+I4qnU3H+kn0fxnsb8D4AJ0lAYyrIvVXCFU2enuhQDVl2s5bitfPSu5e7RSqPCFM4A8cRxjPlrzPnw8Wbzkuc9WPLLMgnZoXvtjdTqpgc0mZMpoXTuwxhf3+ENGkfOym6ChE57u/uZ0WuKs8nPZpeRyKI7/hyXhrLtgYDcF1eCRPdmTSBJjuKo+H30Y2D5cepQXnANJwvSN9BKbyhIRqf7jIFaVbCa3WoaC8ar02s/0A= <+++ [2014-02-18 16:48:53][2448][TRACE] 261 ---> host_name=prova core_start_time=1392738533.0 start_time=1392738533.5916 finish_time=1392738533.17299 return_code=0 exited_ok=1 output=OK - 127.0.0.1: rta 0.028ms, lost 0%|rta=0.028ms;3000.000;5000.000;0; pl=0%;80;100;; rtmax=0.067ms;;;; rtmin=0.017ms;;;; \n

<--- [2014-02-18 16:48:53][2448][DEBUG] host job completed: prova: 0 [2014-02-18 16:48:54][2448][TRACE] handle_timed_events(1, data) [2014-02-18 16:48:54][2448][TRACE] handle_host_check(7) [2014-02-18 16:48:54][2448][TRACE] --------------- host Job -> 7, 801 [2014-02-18 16:48:54][2448][TRACE] handle_perfdata(7) [2014-02-18 16:48:54][2448][TRACE] add_job_to_queue(perfdata, prova, 2, 1, 1, 1) [2014-02-18 16:48:54][2448][TRACE] 230 --->DATATYPE::HOSTPERFDATA TIMET::1392738534 HOSTNAME::prova HOSTPERFDATA::rta=0.028ms;3000.000;5000.000;0; pl=0%;80;100;; rtmax=0.067ms;;;; rtmin=0.017ms;;;; \n HOSTCHECKCOMMAND::check-host-alive!(null) HOSTSTATE::0 HOSTSTATETYPE::1 <--- [2014-02-18 16:48:54][2448][TRACE] 320 +++> UdhpucpFM6vtfGcYOXCapjeZ6JPQUGnIKgT+CUcjl1E/KkJZgu0y1j9C8bHslvAUoWg/BhpVzh/6LmmDNjghiptBrz8xchRCLCvMW3RRu1N3iQXX0lYEvOIhoXjDqk92oNR5a/5CwQrnVMIbjELIoU5xuOaa0ckeF55ds9giCzv4Z+JdfUxunzde83URI7hmU3YnqR23tf6eBTuOQ2xcIm+5DHYyCmtPSG6+R65dQQ2eUuFQ4u7nP3K/tO/u1X3DE3KwLDIdmj9hazgoUntGxkVat3R7lxC3qbMzQXIYCf2/v6cRbs3oXxwEY+30qdKJ <+++ [2014-02-18 16:48:54][2448][TRACE] add_job_to_queue() finished successfully: 0 0 [2014-02-18 16:48:54][2448][TRACE] handle_perfdata() finished successfully

dcec commented 10 years ago

I make some other test, in this case pnp4nagios working fine Starting rrdcached...OK Starting npcd...OK Starting naemon...OK Starting dedicated Apache for site provaok...OK Initializing Crontab...OK

when i enable MOD_GEARMAN on OMD pnp4nagios does not display the graph update. I see tha on /pnp4nagios/spool the file are created and deleted, but the .rrd file is not updated. The problem is on rrdcached? why only mod_german for Naemon?

sni commented 10 years ago

this should be fixed meanwhile. Could you try latest version?