romainsi / zabbix-VEEAM_B-R

52 stars 40 forks source link

Get value from agent failed: ZBX_TCP_READ() timed out #28

Closed dmtx97 closed 5 years ago

dmtx97 commented 5 years ago

First of all, I love the template! Occasionally I receive the error listed above on my backup server. Below is the log that is associated with this issue. After a few minutes, connection is restored and it works fine. I have tested the scripts locally and they work fine. Just having them work on Zabbix seems to be the issue. I am not sure if this is an issue with my actual network, or if script execution is causing this. Thank you.

 17415:20190411:085014.467 Zabbix agent item "vbr[ResultBackup,1686a71c-dd85-414b-930e-539a87cd91d5]" on host "pcorpbkp01.adc.dbu.edu" failed: another network error, wait for 15 seconds
 17422:20190411:085032.735 cannot send list of active checks to "172.17.5.3": host [pcorpsqlaa.adc.dbu.edu] not found
 17415:20190411:085039.478 temporarily disabling Zabbix agent checks on host "pcorpbkp01.adc.dbu.edu": host unavailable
 17450:20190411:085115.440 cannot send list of active checks to "172.17.1.160": host [.adc.dbu.edu] not found
 17448:20190411:085136.468 cannot send list of active checks to "172.17.2.18": host [.adc.dbu.edu] not found
 17416:20190411:085141.219 enabling Zabbix agent checks on host "pcorpbkp01.adc.dbu.edu": host became available
 17470:20190411:085231.879 cannot send list of active checks to "172.17.5.3": host [pcorpsqlaa.adc.dbu.edu] not found
 17424:20190411:085315.425 cannot send list of active checks to "172.17.1.160": host [.adc.dbu.edu] not found
 17490:20190411:085336.125 cannot send list of active checks to "172.17.2.18": host [.adc.dbu.edu] not found
 17486:20190411:085432.017 cannot send list of active checks to "172.17.5.3": host [pcorpsqlaa.adc.dbu.edu] not found
 17367:20190411:085438.005 Zabbix agent item "vbr[VmCountResultBackup,Programing Workstation,Failed]" on host "pcorpbkp01.adc.dbu.edu" failed: first network error, wait for 15 seconds
 17415:20190411:085457.817 resuming Zabbix agent checks on host "pcorpbkp01.adc.dbu.edu": connection restored
 17476:20190411:085515.371 cannot send list of active checks to "172.17.1.160": host [.adc.dbu.edu] not found
 17417:20190411:085536.812 cannot send list of active checks to "172.17.2.18": host [.adc.dbu.edu] not found
 17486:20190411:085632.156 cannot send list of active checks to "172.17.5.3": host [pcorpsqlaa.adc.dbu.edu] not found
 17435:20190411:085715.285 cannot send list of active checks to "172.17.1.160": host [.adc.dbu.edu] not found
 17489:20190411:085736.454 cannot send list of active checks to "172.17.2.18": host [.adc.dbu.edu] not found
 17448:20190411:085832.298 cannot send list of active checks to "172.17.5.3": host [pcorpsqlaa.adc.dbu.edu] not found
 17500:20190411:085915.281 cannot send list of active checks to "172.17.1.160": host [.adc.dbu.edu] not found
 17427:20190411:085936.090 cannot send list of active checks to "172.17.2.18": host [.adc.dbu.edu] not found
 17416:20190411:085949.936 Zabbix agent item "vbr[ResultBackup,e3cfcccb-62e2-4a6c-a547-003c52657432]" on host "pcorpbkp01.adc.dbu.edu" failed: first network error, wait for 15 seconds
 17415:20190411:090014.981 Zabbix agent item "vbr[ResultBackup,1686a71c-dd85-414b-930e-539a87cd91d5]" on host "pcorpbkp01.adc.dbu.edu" failed: another network error, wait for 15 seconds
 17504:20190411:090032.442 cannot send list of active checks to "172.17.5.3": host [pcorpsqlaa.adc.dbu.edu] not found
 17416:20190411:090039.016 temporarily disabling Zabbix agent checks on host "pcorpbkp01.adc.dbu.edu": host unavailable
romainsi commented 5 years ago

Hello,

I am thinking more of a problem of insufficient resources in Zabbix server or agent. It should be checked on the Zabbix server if the Zabbix busy poller processes and Zabbix busy unreachable poller processes elements do not cap at 100% : image

If this is the case, it will be necessary to add poller instances in zabbix-server.conf.

Also the Number of pre-forked instances of zabbix_agentd can be increased (ex: StartAgents=10).

Then also check that the pcorpsqlaaa.adc.adc.dbu.edu server remains accessible. Can be tested with the Zabbix Ping template (icmp).