NASA-PDS / nucleus

Nucleus is a software platform used to create workflows for the Planetary Data (PDS).
https://nasa-pds.github.io/nucleus
Apache License 2.0
0 stars 0 forks source link

Nucleus: Track the DAG processing status of each PDS product for monitoring and troubleshooting #88

Closed ramesh-maddegoda closed 3 months ago

ramesh-maddegoda commented 9 months ago

💡 Description

At the moments, multiple product labels are processed in Nucleus DAGs as batches. The Airflow UI shows the completion status of each batch and it is also possible to identify individual product failures in a given batch by checking task logs on Airflow UI.

However, when we process thousands of products within a short duration, it is more efficient to keep track of the processing status of each PDS product in a central place such as a database. Also, this will allow us to easily query for failures and troubleshoot.

This task is to design and implement an approach to keep track of processing status of individual PDS products.

⚔️ Parent Epic / Related Tickets

No response

tloubrieu-jpl commented 5 months ago

Ramesh is adding a status monitoring in the nucleus RDS database, as a new table to avoid conflict with the product completion tracking.

ramesh-maddegoda commented 4 months ago

Created pull request: https://github.com/NASA-PDS/nucleus/pull/117

tloubrieu-jpl commented 4 months ago

The status for each DAG steps are stored. It is not yet information available to the users.

ramesh-maddegoda commented 3 months ago

The pull request https://github.com/NASA-PDS/nucleus/pull/117 is merged.