As a researcher, I want to be able to access all provenance data (all parameters, software versions of tools used and locations/checksums of source data) for future reference when publishing the results of the analysis, deciding whether I need to regenerate derivatives with a new parameter set, or attempting to reproduce the results of another researcher on my dataset.
Acceptance Criteria
When a pipeline, is run, full provenance data is stored alongside the derivatives such that a pipeline can be rerun and any changes to upstream derivatives can be detected.
Description
As a researcher, I want to be able to access all provenance data (all parameters, software versions of tools used and locations/checksums of source data) for future reference when publishing the results of the analysis, deciding whether I need to regenerate derivatives with a new parameter set, or attempting to reproduce the results of another researcher on my dataset.
Acceptance Criteria
When a pipeline, is run, full provenance data is stored alongside the derivatives such that a pipeline can be rerun and any changes to upstream derivatives can be detected.