nextflow-io / nf-co2footprint

[WIP] A Nextflow plugin to estimate the CO2 footprint of pipeline runs.
https://nextflow-io.github.io/nf-co2footprint/
Apache License 2.0
10 stars 3 forks source link

Inclusion of the carbon footprint of cached processes #62

Open Llannelongue opened 8 months ago

Llannelongue commented 8 months ago

This is to discuss how to present the carbon footprint of cached processes.

Simple example: 3 processes [P1] [P2] [P3]

When presenting the carbon footprint of run (2), we can either:

Option 1 gives a better estimate of the total carbon footprint of the pipeline if we were to run it again from start to finish on new data let's say. But option 2 gives a more accurate estimate of the true carbon footprint of running step (2). And if adding run (1) + run (2), option 2 should be used (otherwise the footprint of [P1] would get double counted even though it was only run once).

It seems to depend a lot on what users want to do with this information, so perhaps best to give both information in the report so that users can decide what to do?

skrakau commented 8 months ago

Yes, I also think both would be useful in the future