RenskeW / runcrate-analysis

2 stars 1 forks source link

T2: Data #2

Open RenskeW opened 1 year ago

RenskeW commented 1 year ago

Input and (intermediate) output data.

RenskeW commented 1 year ago

What is represented in CWLProv RO Bundle in RDF:

D2 File characteristics:

D4 Mapping:

In addition, the following can be found in primary-job.json:

RenskeW commented 1 year ago

Results of analysis of RO-Crates converted by runcrate from CWLProv RO Bundles:

Given that primary-job.json and primary-output.json are not included in the RO-Crates generated by runcrate, all information they contain that is not carried over to ro-crate-metadata.json is lost.

RenskeW commented 1 year ago

Suggested enhancement 1:

Add the creation datetime for (at least) the (intermediate) output files to ro-crate-metadata.json.

RenskeW commented 1 year ago

Suggested enhancement 2:

Add the filename of input and (intermediate) output files and directories to ro-crate-metadata.json.

RenskeW commented 1 year ago

Suggested enhancement 3:

Add the size of input and output files, contained in primary-job.json/primary-output.json, to ro-crate-metadata.json.

RenskeW commented 1 year ago

Suggested enhancement 4:

Add the format of input and output files, contained in primary-job.json/primary-output.json, to ro-crate-metadata.json.

RenskeW commented 1 year ago

UPDATE: Checksum and basename now included in ro-crate-metadata.json.

simleo commented 10 months ago

https://github.com/ResearchObject/runcrate/pull/69 and https://github.com/ResearchObject/runcrate/pull/70 also added creation time, size and format.