DataONEorg / object-formats

DataONE Object Formats controlled vocabulary
Apache License 2.0
1 stars 3 forks source link
d1-cn operations
Release: v1.27
main: main
develop: develop

object-formats

The DataONE Object Formats controlled vocabulary is a simple vocabulary listing key metadata for file and object formats used within the DataONE network (https://dataone.org). The goal of the list is to provide a unique identifier for each file format. The formatId is typically more specific than an associated Media Type, but sometimes they can be the same. For example, the formatId for PNG images is image/png and matches the media type image/png because the media type is specific to one file format. In contrast, the formatId for METS is http://www.loc.gov/METS/, which is more specific than the Media type which is text/xml and which is shared across many formats in the XML family.

The current set of defined formats in use in DataONE is always accessible from the DataONE Object Formats service:

Related work

There have been many format vocabularies created (and many abandoned), including UDFR, GDFR, ProNom, and others. The DataONE vocabulary is simpler, more highly structured, and maintained by the repositories that use it.

Contributing

We welcome the addition of new formats as needed for object types within DataONE and related repositories. To propose a new format identifier:

Release process

Periodically, when new formats have been approved, we will merge the submitted PRs to the develop branch, and test that all changes work together. When the file is ready for release, we will merge the develop branch to master, and tag it with the release tag of the form v1.22, representing the current format service data version. This will then be used to update the DataONE formats service.