nfdi4cat / voc4cat

A SKOS vocabulary for catalysis maintained by NFDI4Cat & friends
https://nfdi4cat.github.io/voc4cat/
Creative Commons Zero v1.0 Universal
10 stars 8 forks source link

💡 Top level concepts / structuring voc4cat #48

Open dalito opened 9 months ago

dalito commented 9 months ago

Guideline for Hierarchies in Voc4Cat

first draft 2023-11-23

Voc4Cat is a taxonomy that organizes concepts by subject. The type of relations between concepts to create hierarchies are strictly Is-A relationships which are expressed by skos:broader and skos:narrower. Thus, voc4cat focuses on categorizing things by what they are. Such so called subject hierarchies with Is-A based hierarchies correspond well with ontological modelling and reasoning as well as semantic search or AI applications.

In contrast, topic hierarchies organize concepts for search, e.g. they might be organized based on what people are reminded on when hearing a term or what they associate with it. Topic hierarchies conflict with ontological modelling.

However, for use cases like e.g. selection lists in UIs grouping concepts by topic is valuable. In voc4cat, skos:collection may be used to create such lists or even list-of-lists. In this topic-wise form of organization part-of-relations or even looser relations (e.g. skos:related) dominate.

It is important to be aware of the differences between subject and topic hierarchies. Here are two examples to make the difference clear. The first organizes the hierarchy by topic (here part-of-relations):

However, in voc4cat we want to create hierarchies based on "Is-A" relations as shown in the 2nd example:

Here are the top level concepts that we propose to add soon:

Related: #33

dalito commented 9 months ago

@nmoust @markdoerr @HendrikBorgelt @schumannj @AleSteB - Feedback welcome!

We tested this proposal with the existing terms and it is unambiguous and straight-forward to apply. The top concepts are similar to those in top-level ontologies (BFO, SIO, DOLCE), to what is proposed in ANSI/NISO Z39.19-2005 or what was proposed as minimal set of terms for agrovoc (which has too many top-level concept). So It should be a safe bet.

markdoerr commented 9 months ago

Hi @dalito, thanks for the hierarchy proposal. We should clarify some terms here:

  1. I would define physical object as an object in space-time with a mass (general matter / material)
  2. temporal entities, like events, actions and processes are still abstract entities, they have no perceivable representation in space-time (=they are immaterial), therefore they should be narrower concept of an abstract entity
markdoerr commented 9 months ago

.. and the same holds true for attributes they are also abstract / immaterial

nmoust commented 9 months ago

Dear @markdoerr ,

thank you for the feedback. So if I understood correctly you suggest that we change the structure to:

  1. Entities
  2. Physical objects
  3. Abstract entities 3.1. Temporal entities 3.1.1 Events 3.1.2 Actions 3.1.3 Processes 3.2. Attributes

And change the definition of Physical object from "Entity with a concrete and physical nature." to "Object in space-time with a mass".

After discussing with @dalito , we can think of using the following structure:

  1. Physical entity
  2. Non temporal abstract entity
  3. Temporal abstract entity 3.1 Events 3.2 Actions 3.3 Processes
  4. Attributes

Maybe this offers a simple structure easy (or easier) for the user to understand.

markdoerr commented 9 months ago

@nmoust and @dalito, yes, I had exactly the first structure in mind, because it generates a logic hierarchy of "Things or Entities", based on their "abstractness". In your proposal it is unclear, why you put Attributes next to physical entities and (non-)temporal entities (so the ordering principle is not obvious).

dalito commented 9 months ago

Def. of "Attributes": An attribute is a characteristic of an entity that is intrinsic to and cannot exist without the entity. (added also to starting message which had no def before)

Informed by: http://www.w3.org/ns/ssn/Property, http://semanticscience.org/resource/SIO_000614

schumannj commented 9 months ago

I am not sure I understand the difference between Attribute and Non temporal abstract entity correctly. E.g Selectivity, is it an attribute of a product in a chemical conversion (under specific conditions with a certain catalyst)? Is "reaction mechanism" or "chemical reaction network" a Non temporal abstract entity? However a "chemical reaction" or "chemical conversion" would be a temporal abstract entity (process)? And the conversion of a reactant would again be an attribute? Is this right?

schumannj commented 9 months ago

I believe in NOMAD the temporal abstract entities are called activity, and furthermore distiguished into process, measurement and analysis. I am not sure what would be the distinction between 3.1(Event), 3.2 (Action) and 3.3(Process) above or if they can be directly mapped.

dalito commented 9 months ago

@schumannj Thanks for feedback!

However a "chemical reaction" or "chemical conversion" would be a temporal abstract entity (process)? And the conversion of a reactant would again be an attribute? Is this right?

Yes.

Regarding activity/process: It comes down to exact definitions but both terms can work in my opinion (we can add a mapping to the nomad-def if it can be referenced by IRI).

We can also add more structure later. But in SKOS we don't want to (and can't) define a full ontological model but just structure terms that are then re-used in data models.

richardlenz commented 9 months ago

"The type of relations between concepts to create hierarchies are strictly Is-A relationships which are expressed by skos:broader and skos:narrower" - skos:broader / skos:narrower does not refer to strict IS-A hierarchies - Instead these relations subsume a variety of hierarchical relationships including IS-A (which is the class-sublass relationship) A strict IS-A hierarchy could be expressed by rdfs:subClassOf

dalito commented 9 months ago

@richardlenz thanks for the comment! "skos:broader / skos:narrower does not refer to strict IS-A hierarchies" - Yes, exactly. SKOS does not restrict the use to IS-A. In principle, one can create any kind of "hierarchy mess" via broader/narrower without violating any rule.

So the above should be seen as a guidance principle for our vocabulary. If this is followed well, re-using voc4cat in ontologies should be easier than if we have no guidance principle.

richardlenz commented 9 months ago

@dalito > .. "So the above should be seen as a guidance principle for our vocabulary. If this is followed well, re-using voc4cat in ontologies should be easier than if we have no guidance principle."

Thats true, but the purpose of a SKOS Vocabulary is to increase findability of terms by including vague relations of terms rather than limiting the hierarchy to strict class-subclass relations. (Otherwise these vague relations would be lost.). It is true that this does not allow for set based inference, but this is not the purpose of SKOS, and an inference engine would not interpret skos:narrower as a subclass relationship anyway. So therefore I would recommend to be aware of this distinction and simply add rdfs:subClassOf relationships where this actually holds. Alternatively, proper subclass relationships could actually be modelled as such an then the skos:narrower relationship can be inferred from that.

RoteKekse commented 6 months ago

hey all, i am contemplating continuing contributing voc4cat but i saw that thephotocatalysis part is not obying the srict is a relation right now. so i am a bit unsure on how to evaluate what is there already and what not. is it planned to correct the hierachy?

dalito commented 6 months ago

Yes, as you observed some photo catalysis term hierarchies are not yet fully aligned with the approach suggested here. We still have the plan to fix this. Probably as part of a PR that adds the top level concepts.