Fraunhofer-FIT-DSAI / drk-information-model

Datenraum Kultur (DRK) Information Model
3 stars 0 forks source link

DRK Information Model

Datenraum Kultur Information Model (DRK IM): The DRK IM is a Semantic Information Model that aims to define formal specifications of concepts, relationships, and constraints (application profiles) within the culture dataspace. This formalization provides a structured framework for understanding and organizing information within the DRK project. In addition, this forms a basis for representing information in DRK in the form of a Knowledge Graph (DRK KG) with more advanced querying, inference, and reasoning capabilities.

DRK IM Contents

Naming Conventions

How-To (DRK IM Development Process)

Goals

Prerequisite

Example use case scenario:

An example workflow that illustrates the data source, the potential data consumers and data flow.

Steps

DRK-IM-Development-Process

  1. Identification of concepts, properties, and relationships
    • Actors: UC partners (domain experts), with support from FIT DRK IM Group (Semantic Web experts).
    • For one or more data sources, a new GitHub issue should be created (preferably by a UC partner). This issue should specify the new concepts, properties, and/or application profiles (constraints) that need to be added to the DRK Information Model (IM) based on the data model of the data source(s).
    • Preferable formats: A graph, a JSON Schema as a part of OpenAPI spec, a UML Class Diagrams or an Entity Relationship (ER) Diagram that specifies the concepts, properties, and their interrelationships. A link to Miro, Mural, draw.io, or lucidchart board is also welcome. Where possible, the UC partner suggests existing ontologies or vocabularies where these concepts are defined. Even semantic descriptions in TTL/TRIG/JSON-LD format can be entered.

Creation of meaningful identifiers for culture domain specific entities: While instantiating the culture domain specific entities such as Theaters, Events, Plays, we would need to assign them with identifiers. It would be desirable to use identifiers that are machine-readable as well as human-readable and understandable. Such a naming scheme can be developed in this issue: https://github.com/Fraunhofer-FIT-DSAI/drk-information-model/issues/2

  1. Start of the semantic metadata modelling process

    • Actors: FIT DRK IM Group.
    • The DRK IM group identifies ontologies and vocabularies for semantically representing these concepts and creates (visual) semantic metadata representations, preferably based on the ontologies identified in the DRK IM layers diagram (see below).
  2. Iterative review, feedback, and revision

    • Actors: UC partners and FIT DRK IM Group.
    • The semantic metadata representations and application profiles (constraints) are discussed and updated in an iterative manner.
    • Once final, the DRK IM group creates formal semantic descriptions (RDF) and constraints (SHACL shapes) and merges them as a pull request with the DRK IM repository’s main branch.
  3. Release

    • Actors: FIT DRK IM Group.
    • The DRK IM release snapshots will follow the SemVer versioning specification.
  4. Knowledge graph creation: Registration of data sources as assets

    • Actors: UC partners, with support from FIT DRK IM Group, if required.
    • As a part of registration of data sources as assets, the semantic metadata schemas and application profiles from the DRK IM are used to create metadata instances.
    • This registration of data sources happens through EDC’s user interface or API.
    • This forms the DRK Knowledge Graph (DRK KG).

More Information

DRK IM in the Context of the European Interoperability Framework (EIF)

DRK IM focuses on the interoperability aspects within the Technical and Semantic Levels of the EIF, particularly the use of generic and domain-specific information models, together with tooling support to enable interoperability in the data space for culture.

EIF iSHARE (Image sources: iSHARE Cookbook for Data Spaces, EIF)

DRK IM Layers (Work in progress)

To facilitate both the modular representation and FAIRness (Findability, Accessibility, Interoperability, and Reusability) of information, the DRK IM is structured into layers, as illustrated in the following figure. These layers are realized using the existing ontologies and vocabularies.

DRK IM Layers (Graphic: Rohit Deshmukh, Daham Mohammed Mustafa/Fraunhofer FIT)

Glossary

Digital Asset (subset: "Data Asset")

Data Asset

Metadata

Content data

Data model