Improve error handling of index job management

Azure-Samples / graphrag-accelerator

One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure

https://github.com/microsoft/graphrag

MIT License

1.89k stars 311 forks source link

Improve error handling of index job management #166

Closed jgbradley1 closed 2 months ago

jgbradley1 commented 2 months ago

This PR improves the error handling related to when/if indexing jobs error out (for example, due to OOM errors) before having a chance to update the job status in cosmosdb.

To resolve this edge case, the index job manager (a cronjob) has been updated to check both the index job status in cosmosdb and checks whether or not there is a k8s job running.