astronomer / ask-astro

An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomer
https://ask.astronomer.io/
Apache License 2.0
192 stars 47 forks source link

Add basic monitoring #60

Closed pankajastro closed 10 months ago

pankajastro commented 11 months ago

closes: https://github.com/astronomer/ask-astro/issues/39

an example post success and fail service status

Screenshot 2023-11-01 at 11 27 58 AM
cloudflare-workers-and-pages[bot] commented 11 months ago

Deploying with  Cloudflare Pages  Cloudflare Pages

Latest commit: 6762d20
Status: ✅  Deploy successful!
Preview URL: https://9cdff5ab.ask-astro.pages.dev
Branch Preview URL: https://monitoring-dag.ask-astro.pages.dev

View logs

pankajastro commented 11 months ago
  • On a regular basis we can post the status on slack.
  • We should run the DAGs like very 10 minutes or so and check if anything is down if it is we should post a status on Slack (and maybe email folks).

As per the current state of PR, it would post on Slack and we can add an env variable for schedule intervals for example */10 * * * *

sunank200 commented 11 months ago
  • On a regular basis we can post the status on slack.
  • We should run the DAGs like very 10 minutes or so and check if anything is down if it is we should post a status on Slack (and maybe email folks).

As per the current state of PR, it would post on Slack and we can add an env variable for schedule intervals for example */10 * * * *

As discussed in call:

pankajastro commented 11 months ago

@sunank200 I have tested this you can check the sample message in the PR description

pankajastro commented 11 months ago

@jedcunningham requesting your feedback on this!