frictionlessdata / datapackage

Data Package is a standard consisting of a set of simple yet extensible specifications to describe datasets, data files and tabular data. It is a data definition language (DDL) and data API that facilitates findability, accessibility, interoperability, and reusability (FAIR) of data.
https://datapackage.org
The Unlicense
481 stars 107 forks source link

Promote "Metadata in Table Schema" recipe to the specs #899

Open roll opened 3 months ago

roll commented 3 months ago

Overview

The recipe is published here - https://datapackage.org/recipes/metadata-in-table-schema/

It's heavily used in one of the most prominent Data Package adoption cases - http://schema.data.gouv.fr/

I'm ccing @johanricher for more details here

amelie-rondot commented 3 months ago

Context

In 2019, we introduced a pattern for schema metadata properties, to be able to describe their name, description and other caracteristics. This helps users to understand schemas and increase their sharing and reuse, for example as part of a cataloging use case.

These metadata properties have since been used by a significative number of schemas, most of which have been created in France and cataloged on schema.data.gouv.fr.

Examples of adoption:

Some of the properties have also been implemented in frictionless-py:

Proposition

In order to solidify the growing adoption of the metadata properties and increase the coherence between the spec and implementation, we propose to add a subset of those properties, those most frequently used, to the Table Schema specification and documentation, as part of the v2 Frictionless Data specs.

We will also propose via an issue on the frictionless-py repository to implement those properties in the library and document them.

All those properties would stay optional to ensure the retrocompatibility of the spec and implementations with existing schemas.

Adding other custom properties, would still be allowed and tolerated by implementations such as frictionless-py

Next

We propose to contribute to all or part of this work.

roll commented 2 months ago

Thanks a lot, @amelie-rondot!

Would you be interested in creating a PR for this change (please take a look at the v2 Contribution Guideline), or would you like me to work on it?

amelie-rondot commented 2 months ago

Hello @roll, From now, I will not have enough time to continue to work on frictionless and Validata projects. But my french colleague Pierre Camilleri takes the lead on it and is interested to create this PR to adopt this change.

roll commented 2 months ago

Hi @pierrecamilleri,

Amazing! Please let me know if you need any help

roll commented 2 months ago

Hi @pierrecamilleri,

Please take into account that the changes that will consist Data Package (v2) needs to be accepted by the Working Group by the end of May so we need to make a proposal on this one in next few weeks otherwise it will be in later versions

pierrecamilleri commented 2 months ago

Thanks for the reminder ! I am currently working on it so I should propose a PR in the coming days if not today.