Materials-Data-Science-and-Informatics / MDMC-NEP-top-level-ontology

This repository collects the ongoing work towards the development of the ontology on common terms defined for the MDMC Joint Lab and NEP.
MIT License
1 stars 5 forks source link
materials-informatics materials-science ontology provenance-tracking

The PRIMA Ontology

This repository collects the ongoing work towards the development of the top-level ontology based on common terms defined for the Joint Lab "Integrated Model and Data Driven Materials Characterization" (MDMC) and for the "Nanoscience Foundries and Fine Analysis Europe Pilot" (NEP). The top-level glossary defining the terms is available (as a living document which can be constantly updated) on the NEP website: https://www.nffa.eu/apply/data-policy/glossary

The aim of this joint activity is to develop the PRovenance Information in MAterials science (PRIMA) which can be initially adopted by MDMC and NEP. PRIMA is an ontology that captures the provenance information in the materials science domain. In future, it might also be adopted by other Materials Science projects. This will have the huge advantage of having a common description of concepts and relationships in the domain of Materials Science. This will offer a set of metadata which, in turn, will increase the interoperability and the reuse of data.

Table of content

  1. PRIMA Documentation
  2. Use Cases
  3. Usage
  4. Future Extensions
  5. Contact
  6. License
  7. Acknowledgements

PRIMA Documentation

PRIMA is a modular ontology consisting of four modules:

  1. PRIMA-Core module: The PRIMA-Core module consists of top-level classes and properties that can be reused in other modules. The core module is developed to provide general provenance information in the materials science domain, especially in the experimental workflow.

  2. PRIMA-Data Analysis Lifecycle module: In the Data Analysis Lifecyle module, the classes and properties related to a data flow are described.

  3. PRIMA-Dataset: This module describe the classes and properties related to the structure of the dataset in the context of research.

  4. PRIMA-Experiment: In the Experiment module, the classes and properties related to an experiment are described

  5. The ontology combining all the above modules is PRIMA-complete

Use Cases

So far, we have demonstrated the broad applicability of PRIMA by presenting two different use cases: (i) the mapping of the FAIRification workflow applied to Scanning Tunneling Microscope (STM) images from data acquisition to data analysis and (ii) the PRIMA alignment of the fabrication processes ontologies applied to metallic biomaterials recorded in the Herbie Electronic Laboratory Notebook (ELN).

Use case 1: Scanning Tunneling Microscopy (STM) Images

In this use case, we extend the work done by Rodani et al., (2023) by mapping its provenance data model to PRIMA. The provenance data model of STM images follows the PROV-DM standard and is serialized by the PROV-JSON serialization, i.e., the metadata is in the JSON format. Furthemore, the mapping is done by connecting JSON objects into PRIMA, so that each of JSON objects is an instance of a PRIMA class.

The use case including the mapped ontology and the RDF data can be accessed here.

Use case 2: Metallic Biomaterials Fabrication in the Herbie ELN

Herbie is a hybrid system between an ELN and a research database developed at the Helmholtz-Zentrum Hereon. Herbie is tailored to cover and interlink the heterogeneous process chain of metallic biomaterials research, including materials development, biological characterization, and synchrotron imaging; nevertheless, due to its modular structure, it can be adapted to other fields.

In this use case, the Herbie ontology, an ontology is used in Herbie, is extended to be aligned to PRIMA. A successful ontology alignment involves identifying relationships between entities in different ontologies to establish links and similarities between the source and target ontologies. The analysis focuses on concepts that overlap but may have different names (synonyms) or types in the ontologies. This alignment supports the generation of linked data and boasts more interoperability of Herbie within the materials science data.

The use case including the extended Herbie ontology and RDF data generated from it can be accessed here.

Usage

Future extensions

Contact

You may contact one of the authors of PRIMA via a.ihsan@fz-juelich.de

License

The code is licensed under the MIT license. Copyright © 2023.

Acknowledgements

This work has been supported by the Joint Lab “Integrated Model and Data Driven Materials Characterization” (MDMC), Helmholtz Metadata Collaboration (HMC) within the Hub Information at the Forschungszentrum Jülich, the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG) in the framework of the project FAIRmat (Project ID: 460197019); the DFG under the National Research Data Infrastructure – NFDI 38/1 – project number 460247524, the research programs “Engineering Digital Futures” and “Materials System Engineering” of the Helmholtz Association of German Research Centers, NFFA-Europe Pilot (NEP) Joint Activities and the Use case 1 of EOSC-Pillar (EOSC-Pillar) project.