mlcommons / croissant

Croissant is a high-level format for machine learning datasets that brings together four rich layers.
https://mlcommons.org/croissant
Apache License 2.0
452 stars 41 forks source link

Semantic annotations / triplification #739

Open benjelloun opened 2 months ago

benjelloun commented 2 months ago

Define a complete mechanism to add semantic annotations to Croissant data, that builds on the existing dataType/equivalentProperty approach to annotate Croissant data with classes and properties from existing vocabularies.

This mechanism should make it possible to generate a triple representation of annotated Croissant data, given the user sufficient control over the generated triples.