HariSekhon / Nagios-Plugins

450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
https://www.linkedin.com/in/HariSekhon
Other
1.13k stars 507 forks source link

Add extra metrics to check_kafka script(s) #201

Open jazzl0ver opened 6 years ago

jazzl0ver commented 6 years ago

Hi,

According to Gwen Shapira's presentation (https://youtu.be/ppDxaqpw8Bw), there're some Kafka metrics that should be monitored in any case (besides the one you've already implemented):

Any chances you would add these to the script(s)? It would be really awesome!

HariSekhon commented 6 years ago

These have been on the backburner for a while actually as I'm coding so many different tools on github.

Right now the standard broker write test can test each partition and enforce the ack from a given number of brokers (waiting to ack from a broker that is offline would cause the check to fail with a write self timeout raising a failure condition).

Also the zookeeper checks can check pretty much anything in zookeeper including child znodes and ephemeral znodes used for cluster memberships, znode contents etc.

I think it would be nice to make additional obvious Kafka checks along the lines you have mentioned above and will leave this ticket open until I do.

chandanbalaji121986 commented 4 years ago

Do we have any progress made on this request? it will help my requirement as well.

HariSekhon commented 4 years ago

I would have done this at my last gig was a hedge fund that was pretty terrible and didn't let me do any reusable development or anything for github... I'm not working on Kafka at my current gig, but when I next am I will try to get back to this. I don't have an ETA on this and it may be a while...

chandanbalaji121986 commented 4 years ago

Thank you Hari.

On Tue, Dec 10, 2019 at 7:18 PM Hari Sekhon notifications@github.com wrote:

I forgot about this ticket, I would have done this at my last gig but they were pretty terrible which stopped me doing any reusable development out here... I am not working on Kafka at my current gig, but when I next am I will try to get back to this.

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/HariSekhon/Nagios-Plugins/issues/201?email_source=notifications&email_token=AMA6TBPCZOI77UYX7TLEDEDQX6MZJA5CNFSM4FQA7DTKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEGPJA2A#issuecomment-564039784, or unsubscribe https://github.com/notifications/unsubscribe-auth/AMA6TBJ7TL3PNGK3TY7THW3QX6MZJANCNFSM4FQA7DTA .

-- Thanks and Regards, Chandanbalaji AS 7259264264 aschandanbalaji1@gmail.com