fulcrumgenomics / dagr

A scala based DSL and framework for writing and executing bioinformatics pipelines as Directed Acyclic GRaphs
MIT License
69 stars 14 forks source link

I want to know all the inputs and outputs produced by the pipeline, and who made them #317

Open nh13 opened 6 years ago

nh13 commented 6 years ago

At the end of the execution, I want a report that has the following columns:

I am not sure how to implement it, but perhaps with some annotations (@input, @output, @delete)?When a task is scheduled, inputs are added. When tasks execute successfully, the outputs are added (and checked). When a task deletes an file, the file is marked as deleted. For the tasks that delete files, we could instead use @input(deleted=true).