mobilityDCAT-AP / mobilityDCAT-AP

Repository of the metadata specification mobilityDCAT-AP
https://w3id.org/mobilitydcat-ap
Creative Commons Attribution 4.0 International
12 stars 5 forks source link

The "Publication" term seems to by synonymum for "Dataset" #52

Open vlcinsky opened 1 month ago

vlcinsky commented 1 month ago

See the definitions in the document https://mobilitydcat-ap.github.io/mobilityDCAT-AP/releases/1.0.1/index.html#terminology

A Dataset is a collection of data published or curated by a single source and available for access or download in one or more formats. In the mobility context, this might be, e.g., a data collection about static parking information for truck drivers published by a road authority.

and then

A Publication is an abstract construct that covers a dataset published by a data publisher and which has one or multiple distributions.

While Dataset is widely used, I did not find any specific use of Publication term.

I did not find any definition for Publication in DCAT-AP v2 and DCAT-AP v3.

Consider removing this term from mobilityDCAT-AP.

marioscrock commented 4 weeks ago

Thank you @vlcinsky for opening the issue and reporting the malicious comment above (I deleted it). We will discuss this topic during the next mobilityDCAT-AP meeting and get back to you.

EdNDW commented 2 weeks ago

In the Netherlands, we distinguish between publications and datasets. A publication is a description of one or more datasets on the same subject. A dataset is a specific version in a particular format. For example, in the Dutch NAP, there is a description of one real-time publication related to travel times. This publication contains various datasets in formats such as XML and JSON.

vlcinsky commented 2 weeks ago

Thanks @EdNDW for feedback. Terminology is hard but important topic.

We have created study DATEX@NAPs and struggled with terminology. Almost every institution or publisher has their own terminology, e.g.:

For that reason we searched for some common ground and found that mobilityDCAT-AP is the one - it is maintained, updated and relates to mobility data.

Our terminology based on mobilityDCAT-AP is describe here: DATEX@NAPs/Key Concepts

I have to say, it is not easy to stick with any agreed terminology - the habit of "our own best terms" is hard to overcome.

Regarding terms "dataset" and "publication" you have described:

It would propose to organize a workshop to discuss and clarify the moblityDCAT-AP terms. Ideally reviewing some text which we really use for talking about the data we provide.