gbv / format.gbv.de

Formatdatenbank
https://format.gbv.de/
GNU General Public License v3.0
7 stars 4 forks source link

Add section on (atomic) data types #71

Open nichtich opened 7 months ago

nichtich commented 7 months ago

The term data types ("Datentypen") is used for data formats being part of a programming language or another data format. The term is not helpful to differentiate types of data formats but the stricter atomic data types or primitive data types are missing from the format registry: atomic data types ("Atomare Datentypen") are data elements that are not divided into multiple sub-elements. They still can be derived from another atomic data type. Examples:

String-based atomic data types are often restricted by a regular expression.

Atomic data types already included in the format registry:

Atomic data types are collected in the Data Type Registry (DTR) and ePIC Data Type Registry, among other data types, so we could reference and/or import from there. Wikidata contains around 8500 identifier types with regular expression, not all of them used as data types though. Wikidata further contains some primitive data types.