artefactual-sdps / enduro

A tool to support ingest and automation in digital preservation workflows
https://enduro.readthedocs.io/
Apache License 2.0
4 stars 3 forks source link

Problem: Difficult to tell when tasks fail in successful workflows #987

Open fiver-watson opened 4 months ago

fiver-watson commented 4 months ago

Description

Right now, it is possible for a Preservation action workflow to complete successfully, despite the fact that some Archivematica tasks may fail - for example, parsing a premis.xml file included in a PIP and writing it to the AIP METS file.

Because Archivematica outputs so many tasks, a single error can easily be overlooked in the long list (generally over 100) of tasks in an AIP creation workflow. There is also no indication at the Preservation action header level that any issues or failures can be found in the tasks below, making it easy for archivists to potentially miss critical information along the way while evaluating the success of an AIP creation workflow.

This is compounded by the fact that currently the Package Statuses legend visible as an expandable on the Packages browse page lists the definition of the "Done" status as:

The current workflow or task has completed without errors.

(emphasis on "without errors" added).

Combined, this means that while potentially business critical preservation tasks might have failed, a user may never notice this, and may continue to preserve content with undesirable errors in the AIP creation process.

We should clarify the wording of the Done status, and better highlight when there are non-critical errors in the tasks of a given workflow, so archivists can decide for themselves whether this invalidates the AIP or can be safely ignored.

To reproduce

  1. Use enduro main branch no later than commmit 5763d35ebf1df28a5a827411509e39c6549aa5b1
  2. Run a Vecteur SIP or AIP through
  3. Inspect the results - check that the status is "Done"
  4. Expand the preservation actions, scroll down and around task 33 or so, notice that the "Load PREMIS events from metadata/premis.xml" task has failed
  5. Return to the Packages browse page, expand the Statuses legend, and read the definition of the "Done" status

Resulting error

Expected behavior

Additional context and proposed resolution

I propose the following changes:

Here are the proposed changes to the Status legend and the definitions. You can also see an example of the new :warning: status being included on an otherwise successful status in the table below the legend:

enduro-pkg-statuses-legend-w-warning

Here is a package details page, showing a Create AIP preservation action that includes an error:

enduro-pres-actions-w-warning

Here is the same page, when hovering over the :warning: icon:

enduro-pres-actions-w-warning-hover