Open cmungall opened 1 year ago
I notice that ENVO has a forest ecosystem
term. Should deciduous forest
be a subclass of that?
On a more general note, I agree with you that we need design patterns that avoid defining things as immaterial entity
.
@wdduncan thanks for your comments!
notice that ENVO has a forest ecosystem term. Should deciduous forest be a subclass of that?
The essence of my proposal is that we would like to see a class hierarchy that organized something like this
We are neutral as to whether these are conceived of as ecosystems, biomes, whatever. This would also work for us
Note that for brevity I am only showing one path here but this could potentially be a lattice.
The main point of my proposal is that the concepts annotators need should be in one hierarchy, rather than the concept of "decidous forest" being in one hierarchy, "temperate deciduous needleaf forest ecosystem" being in another, etc.
On a more general note, I agree with you that we need design patterns that avoid defining things as immaterial entity.
To be clear, I am not proposing that we avoid adding immaterial entities. I would prefer to see the use cases for these clearly articulated, but I trust these use cases exist.
We just need to capture the base concepts in a single hierarchy.
Right now, an annotator who is looking to annotate a "deciduous forest" concept is likely to pick "area of deciduous forest" (this has indeed happened for NEON annotation). This means that our annotations are a pick and mix of different branches, and roll-up queries don't work as expected - e.g. annotations to "deciduous forest" do not roll up to "forest ecosystem", and annotations to "temperate deciduous needleaf forest ecosystem" do not roll up to "deciduous forest".
@cmungall I agree with your reasoning about the advantages of a single hierarchy.
To be clear, I am not proposing that we avoid adding immaterial entities. I would prefer to see the use cases for these clearly articulated, but I trust these use cases exist.
I didn't think you were. This is my opining ;)
Perhaps my use of 'avoid' sounded to strong. Here is all I meant.
It seems to me that immaterial entities (and ICEs) generate a number of these shadow issues. Consider (for example) how we would represent the northern hemisphere or Eastern Europe. Such entities invite the use immaterial entity (especially amongst the more philosophically/BFO minded developers .... I think I may still be included in this group). This, in turn, increases the chance of creating shadow hierarchies.
It would be better (again my opinion) if the default stance was to classify things as material entities, and use immaterial entity only when a clear and motivating case can be made.
After a long querying and manual filtering exercise of terms from the environmental system
, environmental zone
, layer
, and astronomical body part
hierarchies, filterning it into a around 1164 terms in ~100 categories. I found the following. Shown below are the 28 most problematic concepts which are only or mostly found in some combination of biome, layer, environmental zone, ecosystem, (or others), but not in the astronomical body part
hierarchy, which is the closest current hierarchy to the single branch with the bulk of terms. I think cleaning up all terms associated with these concepts would help with the vision of a single hierarchy for curators.
<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:x="urn:schemas-microsoft-com:office:excel" xmlns="http://www.w3.org/TR/REC-html40">
ENVO Parent | Category -- | -- ecosystem | bog layer | canopy geographic feature and ABP | cut ecosystem | ecotone Various | estuarine ecosystem | farm layer | floor Various | forest layer | front Various | grassland Various | marine ecosystem | marsh ecosystem | meadow ecosystem | mire various geographic feature, biome and layer | neritic environmental zone | oasis various ecosystem and ABP | palsa Various: biome and layer | pelagic ecosystem | plantation environmental zone | plate environmental zone | scrubland biome | shrubland Various | swamp Various | tidal/tide Various | tundra Various: layer, environmental zone | vegetation Various | wetland Various | woodland
As part of the NEON/NMDC collaboration we have been mapping NEON terms to ENVO. The team needed a concept "deciduous forest", and the closest they could find was "area of deciduous forest", so this was used.
However, this leaves us in an odd position. If we look at the placement of "area of deciduous forest" in comparison to other more specific "deciduous forest" terms:
We see that they are in completely separate branches. We would like all terms we use to be in a single is-a/part-of hierarchy, such that we can do standard ontology operations such as provide faceted browsing, roll up terms for analyses, etc.
The two branches here also surfaces a number of other unusual decisions.
In tropical, we have "tropical {...} broadleaf forest IS_A tropical broadleaf forest":
But this is inconsistent with "temperate {...} broadleaf forest PART_OF temperate broadleaf forest":
It would be useful if we could somehow tease apart the core concepts in ENVO independent of whether something is considered an area, an ecosystem, a biome, or an astronomical body part. For annotating sample sources and many other applications of users would appreciate something that looks more like anatomy ontologies, GO, etc, with less duplication of concepts, where the core concepts are in one hierarchy and follow standard classification patterns.
In this classification we would have a general concept like "forest", and then we would have a fairly consistent lattice of sub-types (no part-ofs) for the various ways a forest can be classified: tropical vs temperate, needle vs broad leaf, deciduous vs non-deciduous...
We would reserve parthood for things that are clearly composition - a forest is made of trees, a canopy is part of a forest, the forest is part of the terrestrial composition of the earth, etc
Then for groups that need to distinguish area vs ecosystem vs biome vs feature, these could be added as separate branches; however, the simple core could still be extracted and it would form a coherent consistent core.