Open jluethi opened 1 year ago
As of #803, we introduced a server-side history. This does not necessarily need to fully correspond to the ome-zarr-side history, since:
With this first version of the history, we can answer part of the questions above:
run_fractal_task
function NA (not yet)
I'm starting a new overview issue for the discussions around storing history/status of what workflows were run on a given dataset. We had prior discussions on this topic in https://github.com/fractal-analytics-platform/fractal-server/issues/236, https://github.com/fractal-analytics-platform/fractal-server/issues/108, https://github.com/fractal-analytics-platform/fractal-server/issues/14, https://github.com/fractal-analytics-platform/fractal-tasks-core/issues/177#issuecomment-1432643699 and in some other places.
This issue shall be the overview of what decisions were made and link to all the relevant sub-issues
Motivation
For reproducibility, to allow easier debugging and potentially to allow smarter submission scenarios (rerun parts, rerun what failed etc.), we want to have history of what processing has been applied to which part of the dataset.
Main questions
1. What is written as history? => Issue https://github.com/fractal-analytics-platform/fractal-server/issues/507
My default would be information on which task was run with which parameters for each image
2. Where is history written to? => Issue https://github.com/fractal-analytics-platform/fractal-server/issues/508
I strongly favor writing it with the OME-Zarr file. We can discuss whether the database also keeps track of it, but ground-truth should be in the actual OME-Zarr file.
3. Who is in charge of writing the history? => Issue https://github.com/fractal-analytics-platform/fractal-server/issues/509
run_fractal_task
functionI'm unsure about this one
4. When is history written? => Issue https://github.com/fractal-analytics-platform/fractal-server/issues/511
Are there more general questions we need to address on this topic?