DataBiosphere / toil

A scalable, efficient, cross-platform (Linux/macOS) and easy-to-use workflow engine in pure Python.
http://toil.ucsc-cgl.org/.
Apache License 2.0
898 stars 241 forks source link

planning for CWLProv in toil-cwl-runner #2390

Open mr-c opened 6 years ago

mr-c commented 6 years ago

Most of the progress is found on https://github.com/DataBiosphere/toil/tree/wip-prov

┆Issue is synchronized with this Jira Story ┆Issue Number: TOIL-280

mr-c commented 5 years ago

@psafont Can you update the 1st comment above with your status and any additional work you see that is needed?

psafont commented 5 years ago

There's quite a bit of friction in order to do the changes because CWLProv is part of the cwltool package. I don't know up to what point can it be beneficial to separate it into a different module.

There is not much separation of concerns in some functions: they use provenance.py's functions directly. I think this is linked with some of the tight coupling we've already solved. The question is how far do we want to go. (I've only spent about an hour going into @inutano's provenance work)