reanahub / reana-workflow-engine-snakemake

REANA Workflow Engine Snakemake
MIT License
0 stars 21 forks source link

caching: investigate supporting between-workflows caching #68

Open tiborsimko opened 8 months ago

tiborsimko commented 8 months ago

Snakemake offers automated caching capabilities when a user restarts the same workflow on the same workspace. Snakemake automatically reuses outputs of past rules if their inputs did not change, and re-executes only those rules that really need it. This works well and is already fully supported in REANA.

Snakemake offers another experimental caching feature for between-workflows caching. Here the cache is external to the workspaces, so it can be used when the user needs e.g. to store input files or big computations that will be reused in several independent workflows. The user can then govern Snakemake's behaviour by means of a cache: True clause in rules instructing the workflow engine to use it or not. This feature is not currently supported by REANA.

The goal of this issue is: