Earth-Information-System / fireatlas

4 stars 2 forks source link

Playbook on How to Debug Failing DPS Jobs #87

Closed ranchodeluxe closed 1 month ago

ranchodeluxe commented 1 month ago

Problem

Some of us are better at debugging failing DPS jobs than others. Let's write a playbook doc about steps to debug

mccabete commented 1 month ago

Case study in API outage. Problem was related to #93 and was traced back to an outage on the LAADS DAAC side. Now, however, the largefire nrt api is still not updating. My debugging steps were:

-check input data is there -check that feds ran successfully, serialization files through today exist

Now I am re-running the conus nrt lf v2 workflow. I am trying to see if there was some sort of failure.

ranchodeluxe commented 1 month ago

first draft is up via #108