centreon / centreon-engine

Extremely fast monitoring scheduler, forked from Nagios
GNU General Public License v2.0
42 stars 17 forks source link

Centengine can not work #69

Closed kingleoric2010 closed 5 years ago

kingleoric2010 commented 7 years ago

Hello,

Today, i restart the monitoring engine and my poller has stopped. I don't understand why. I checked on the log but there is nothing... So i tried to restart centengine but i got this messsage. [centreon@SSAVL2910 init.d]$ centengine start [1490621288] [5148] Reading main configuration file '/etc/rc.d/init.d/start'. [1490621288] [5148] Error: Parsing of global configuration failed in file '/etc/rc.d/init.d/start' on line 1: Invalid line '123' [root@SSAVL2910 ~]# service centengine start centengine start: failed.[FAILED]

[root@SSAVL2910 ~]# ps -ef|grep centengine root 5279 5186 0 13:31 pts/2 00:00:00 grep centengine

Here is the status of Centengine and centstorage [centreon@SSAVL2910 init.d]$ centengine -v /etc/centreon-engine/centengine.cfg [1490621323] [5150] Reading main configuration file '/etc/centreon-engine/centengine.cfg'. [1490621323] [5150] Processing object config file '/etc/centreon-engine/hostTemplates.cfg' [1490621323] [5150] Processing object config file '/etc/centreon-engine/hosts.cfg' [1490621323] [5150] Processing object config file '/etc/centreon-engine/serviceTemplates.cfg' [1490621323] [5150] Processing object config file '/etc/centreon-engine/services.cfg' [1490621323] [5150] Processing object config file '/etc/centreon-engine/misccommands.cfg' [1490621323] [5150] Processing object config file '/etc/centreon-engine/checkcommands.cfg' [1490621323] [5150] Processing object config file '/etc/centreon-engine/contactgroups.cfg' [1490621323] [5150] Processing object config file '/etc/centreon-engine/contactTemplates.cfg' [1490621323] [5150] Processing object config file '/etc/centreon-engine/contacts.cfg' [1490621323] [5150] Processing object config file '/etc/centreon-engine/hostgroups.cfg' [1490621323] [5150] Processing object config file '/etc/centreon-engine/servicegroups.cfg' [1490621323] [5150] Processing object config file '/etc/centreon-engine/timeperiods.cfg' [1490621323] [5150] Processing object config file '/etc/centreon-engine/escalations.cfg' [1490621323] [5150] Processing object config file '/etc/centreon-engine/dependencies.cfg' [1490621323] [5150] Processing object config file '/etc/centreon-engine/connectors.cfg' [1490621323] [5150] Reading resource file '/etc/centreon-engine/resource.cfg' [1490621323] [5150] Warning: Contact 'lg_helpdesk' has no service notification time period defined! [1490621323] [5150] Warning: Contact 'lg_helpdesk' has no host notification time period defined! [1490621323] [5150] Warning: Contact 'network_screen' has no service notification time period defined! [1490621323] [5150] Warning: Contact 'network_screen' has no host notification time period defined! [1490621323] [5150] Warning: Contact 'oem_screen' has no service notification time period defined! [1490621323] [5150] Warning: Contact 'oem_screen' has no host notification time period defined! [1490621323] [5150] Warning: Contact 'scom_screen' has no service notification time period defined! [1490621323] [5150] Warning: Contact 'scom_screen' has no host notification time period defined! [1490621323] [5150] Warning: Contact 'tivoli_screen' has no service notification time period defined! [1490621323] [5150] Warning: Contact 'tivoli_screen' has no host notification time period defined! [1490621323] [5150] Warning: Host 'SSLIU610' has no services associated with it! [1490621323] [5150] Checking global event handlers... [1490621323] [5150] Checking obsessive compulsive processor commands... [1490621323] [5150] [1490621323] [5150] Checked 96 commands. [1490621323] [5150] Checked 0 connectors. [1490621323] [5150] Checked 14 contacts. [1490621323] [5150] Checked 0 host dependencies. [1490621323] [5150] Checked 0 host escalations. [1490621323] [5150] Checked 4 host groups. [1490621323] [5150] Checked 100 hosts. [1490621323] [5150] Checked 0 service dependencies. [1490621323] [5150] Checked 0 service escalations. [1490621323] [5150] Checked 0 service groups. [1490621323] [5150] Checked 484 services. [1490621323] [5150] Checked 4 time periods. [1490621323] [5150] [1490621323] [5150] Total Warnings: 0 [1490621323] [5150] Total Errors: 0

Do you have any idea how to solve this issue? I have read the issue:https://github.com/centreon/centreon/issues/635.But it is useless for me.

ganoze commented 7 years ago

Could you provide more information on your setup ? I would especially like to know your operating system and how you installed the init script because the latest seems broken as per the following line.

[1490621288] [5148] Reading main configuration file '/etc/rc.d/init.d/start'.

This makes me believe that the init script is trying to load an invalid configuration file instead of /etc/centreon-engine/centengine.cfg.

kingleoric2010 commented 7 years ago

thank you for your answer.I installed the centreon by the ces-standard-3.2-x86_64.rar [root@SSAVL2910 log]# cat /etc/issue Centreon Enterprise Server Kernel \r on an \m [root@SSAVL2910 log]# cat /proc/version Linux version 2.6.32-504.16.2.el6.x86_64 (mockbuild@c6b9.bsys.dev.centos.org) (gcc version 4.4.7 20120313 (Red Hat 4.4.7-11) (GCC) ) #1 SMP Wed Apr 22 06:48:29 UTC 2015 [root@SSAVL2910 log]# lsb_release -a LSB Version: :base-4.0-amd64:base-4.0-noarch:core-4.0-amd64:core-4.0-noarch:graphics-4.0-amd64:graphics-4.0-noarch:printing-4.0-amd64:printing-4.0-noarch Distributor ID: CentOS Description: CentOS release 6.6 (Final) Release: 6.6 Codename: Final

kingleoric2010 commented 7 years ago

thank you for your answer. @ganoze I have another remote poller SSAVL2521 that is almost the same.These two servers are both installed by CES iso.Compared to that server SSAVL2521 , this server SSAVL2910 is only added python2.7.9. Strangely enough, I can not start the centengine service in /etc/init.d. there is the same error. [root@SSAVL2521 init.d]# pwd /etc/init.d [root@SSAVL2521 init.d]# centengine start [1490633264] [26465] Reading main configuration file '/etc/rc.d/init.d/start'. [1490633264] [26465] Error: Parsing of global configuration failed: Can't open file '/etc/rc.d/init.d/start' However, I can start the centengine service through this way [root@SSAVL2521 init.d]# service centengine start centengine start: done[ OK ] [root@SSAVL2521 init.d]# service centengine status centengine status: running[ OK ]

kingleoric2010 commented 7 years ago

[root@SSAVL2910 centreon-engine]# tail -20f centengine.log[1490665013] [9358] INITIAL SERVICE STATE: SSAHN806;windows-memory-usage;OK;HARD;1;OK - Physical Memory: Total: 31.99GB - Used: 18.014GB (56%) - Free: 13.976GB (44%)[1490665013] [9358] INITIAL SERVICE STATE: SSAHN806;windows-diskio_queuelength_c;OK;HARD;1;Overall Status - OK (Sample Period 223 sec) - Logical Drive Name="C:" (OK) - _PercentIdleTime=99%, _PercentBusyTime=1%, _PercentDiskTime=3%, _PercentDiskReadTime=0%, _PercentDiskWriteTime=3%, _DiskReadBytesPersec=0B/sec, _DiskReadsPersec=0/sec, _DiskWriteBytesPersec=12.449KB/sec, _DiskWritesPersec=2/sec, CurrentDiskQueueLength=0, _AvgDiskQueueLength=0.0, _AvgDiskReadQueueLength=0.0, _AvgDiskWriteQueueLength=0.0。。。。。。

kingleoric2010 commented 7 years ago

this is my log in /var/log/messages when I failed to start the centengine service Mar 29 08:05:04 SSAVL2910 kernel: centengine[14389]: segfault at 2acec90 ip 00007f60ce28e7ab sp 00007fff68e3f830 error 6 in libpthread-2.14.so[7f60ce288000+18000] Mar 29 08:11:09 SSAVL2910 kernel: centengine[14580]: segfault at 1508c90 ip 00007f6cab07a7ab sp 00007fff015743c0 error 6 in libpthre I can not find the file libpthread-2.14.so in SSAVL2521 .So I delete the libpthread-2.14.so in SSAVL2910 . The centengine service is OK

SylvestreG commented 5 years ago

Hi, a lot of improvements has been done since this issue.

I will close this issue, feel free to re-open this issue if the problem still happens, we will have a look at it.

Best regards,