IBMStreams / streamsx.monitoring

The com.ibm.streamsx.monitoring toolkit provides capabilities to create applications that monitor IBM Streams and its applications.
https://ibmstreams.github.io/streamsx.monitoring/
Other
5 stars 5 forks source link

Create failed PE detector microservice. #112

Closed ddebrunner closed 5 years ago

ddebrunner commented 6 years ago

A microservice that detects if a PE has been stopped for a period of time (e.g. a minute). Publishes a tuple if it detects such a PE. Ideally it would differentiate between PEs that were manually stopped and those that failed and could not restart after 10 attempts.

Would use the job status microservice from #111.

Use cases (in separate microservices):

I have initial code, needs more work to fully develop to detect manually stopped PEs etc.

ddebrunner commented 6 years ago

Also an attempt to use the monitoring toolkit to build "smart" monitoring and recovery solutions for Streams rather than force everything into the core product.

ddebrunner commented 6 years ago

Example of output sent to Slack using JobStatusService, PEFailedservice and SlackMessageService microservices combined together with no coding.

image

markheger commented 5 years ago

Initial version is part of 2.0 release