DataONEorg / object-formats

DataONE Object Formats controlled vocabulary
Apache License 2.0
1 stars 3 forks source link

Create new format-id for science-on-schema.org Dataset in JSON-LD #21

Closed datadavev closed 3 years ago

datadavev commented 3 years ago

Format Metadata

Format description

Dataset resources may be described by schema.org/Dataset markup serialized as JSON-LD in dataset landing pages. In order for DataONE to collate and index such metadata it is necessary to provide a suitable formatId.

Specification / Namespace documentation

Checklist

Considerations

application/ld+json is likely the most common serialization format, though technically any of the RDF formats may be used.

The formal definition of Dataset is at https://schema.org/Dataset though science-on-schema.org provides numerous recommendations that are necessary for effective use on science data.

Versioning is necessary as recommendations change over time.

mbjones commented 3 years ago

This looks good to me @datadavev . Are you happy with the formatId as proposed? Does anyone else want to weigh in on that before we create a PR?

mbjones commented 3 years ago

My main comment is the formatId is complicated and would likely invite typos.

csjx commented 3 years ago

@mbjones @datadavev - While I think it's a fairly complex formatId string, we have others that are not terribly memorable and prone to typos as well. That said, it's important that we capture the type and version in the formatId and the formatName, and I think this formatId does this pretty concisely. Perhaps the formatName value could be Science on Schema.org Dataset JSON-LD Metadata Version 1.2 or something a bit more verbose. Those are the user-facing descriptions, so I think need to convey the details.

amoeba commented 3 years ago

I'm not sure I see why the ;ld+json is in the formatID. Can we just have it as science-on-schema.org/Dataset/1.2? Otherwise I'm fine with this.

datadavev commented 3 years ago

Because the content can be serialized in multiple formats. This formatId is specific to the JSON-LD serialization.

amoeba commented 3 years ago

Ah, right. Thanks @datadavev.

taojing2002 commented 3 years ago

I am using the format id science-on-schema.org/Dataset/1.2;ld+json in the metacat implemenation. If we will change it, it is better to do it right now to avoid more effort to modify code.

taojing2002 commented 3 years ago

Removed the version from the format id: science-on-schema.org/Dataset;ld+json