neuropoly / data-management

Repo that deals with datalad aspects for internal use
4 stars 0 forks source link

Add custom metadata info about provenance of user data #171

Open jcohenadad opened 2 years ago

jcohenadad commented 2 years ago

Currently, for data shared by users, we add the following info in the JSON. Example:

julien-macbook:~/data.neuro/sct-testing-large/sub-user0017/anat $ cat sub-user0017_T2star.json 
{   "Metadata":
    {
      "added_by": "Alexandru Foias",
      "added_on": "2021-05-27",
      "contact": "XXX",
      "URL": "email from 2021-05-26"
    }

This might not be BIDS-compliant, therefore we need to address this limitation and find a solution.

alexfoias commented 2 years ago

From my understanding, we have the freedom of using this custom field. Most field in the json sidecar files are recommended and very few are required. I think that the bids-validator doesn't check the contents of it.

https://bids-specification.readthedocs.io/en/stable/04-modality-specific-files/01-magnetic-resonance-imaging-data.html

mariehbourget commented 2 years ago

Thanks @alexfoias for the clarification.

I think you are right. I was under the impression that additional "non-standardized" metadata fields were prohibited but I did not find anything preventing it in the spec. Moreover, the BIDS-validator does not raise errors (nor warnings) for the unstandardized fields at the moment.

As a reference, I found this about how to deal with "unstandardized" data and metadata in the spec. It does not name JSON metadata specifically but I think it applies:

Knowing all that, here are some thoughts, feel free to disregard if not appropriate:

jcohenadad commented 2 years ago

The name Metadata is pretty generic and I would recommend using a more specific name, ex: NeuropolyMetadata, so it is clear that it is not standardized.

+1