update README flowchart

robertoostenveld commented 7 months ago

Yesterday @marcelzwiers made some additions to the general README following discussions in the wp15 online meeting. He came up with this draft flowchart:

data owner -> sends data to product owner
data user -> requests anonymous data from the product owner
data user -> sends analysis pipeline to the product owner and requests pseudomized or anonymized output data. This output data is produced by the data user's analysis pipeline running on the source data that is pseudomized (to a necessary degree) by the product owner (using software developed in this WP)
product owner -> requests review and permission from the data owner to send the pseudonymized or anonymized output data to the data user
data user -> repeats step 3-4 until pipeline is finished
??

I suggest to refine this to

data owner -> sends input data to the platform
data user -> requests the product owner for access to the platform
data user -> installs software and dependencies
data user -> requests the product owner for pseudonomized data to be disclosed (using tools developed in this WP)
product owner -> requests the data owner for a review and permission to disclose the pseudonomized data to the data user
data owner -> grants permission
data user -> interactively implements and tests analysis pipeline on pseudonomized data
data user -> requests the product owner for the pipeline to be executed on the source data, output data is not yet disclosed
product owner -> requests the data owner for a review and permission to disclose the output data to the data user
data owner -> grants permission
data user -> uses output data to answer research question and publishes research outcomes

vincent-legoll commented 6 months ago

@robertoostenveld Would it be useful to make the input data scrambling step explicitely in the flow chart ? probably between 4 & 5, or as being part of step 5.

BTW, I think the "scrambled" -> "pseudonymized" is a good move

robertoostenveld commented 6 months ago

The process of scrambling is indeed implicit in-between 4 and 5. I have been a bit sloppy in describing the various actions. I think that they all fall in either: requesting something, doing something that involves data+compute, reviewing and (dis)approving something.

I don't want to renumber the list right now, but will add it to step 5.

robertoostenveld commented 3 months ago

the list that describes the flowchart is still overall correct, but does not include the separation between single subject and group analyses, nor the resampling and noise calibration. However, it is interesting to document the procedure not only from the technical side (i.e., all steps that are done and seen by the SIESTA developers), but also from the researchers side (all steps that the researcher takes) and separately from the data rights holder perspective.

SIESTA-eu / wp15

update README flowchart #8