semanticarts / gist

Semantic Arts gist upper enterprise ontology
Creative Commons Attribution 4.0 International
163 stars 18 forks source link

Rename gist instances to use our naming convention #556

Open rjyounes opened 3 years ago

rjyounes commented 3 years ago

Naming convention from style guide:


A "rigid" class is one that the instance inherently belongs; it is part of the essence of the object, which would not be the same object if it did not belong to this class. A non-rigid class may be temporary and/or express a role or relationship; for example, Child, Patient, Employee. The notion of rigid classes originates in OntoClean.

The "most specific rigid" class is the rigid class that the instance most directly belongs to.

For example, given the class hierarchy Living Thing > Person > Student, where the first two classes are rigid and the third is not, the name for Sir Tim Berners-Lee is _Person_Sir_Tim_Berners_Lee.


None of the gist individuals for units and durations follow this convention.

We should also use the namespacing convention we use for client projects - I.e., we should use https://taxonomies.semanticarts.com/gist/ (gistx). As a result we would have, for example, gistx:_DurationUnit_minute. See issue #305.

Original terms can be deprecated to make this a minor change.

uscholdm commented 3 years ago

This is a good idea, though it will make all the IRIs long and clunky.

rjyounes commented 3 years ago

Submit one PR for #370, #526, and #556.

rjyounes commented 3 years ago

Existing terms need to be added to a gistDeprecated.ttl file - which doesn't currently exist because we just removed it for the major release.

rjyounes commented 3 years ago

See discussion thread on PR #575 (now closed because the issue needs further review).

rjyounes commented 3 years ago

I just noticed that in my current project I've been using the _UnitOfMeasure_ infix, so perhaps that's the way to go after all. In fact, you could argue that BaseUnit is not a rigid class: these units are base units because we've defined them as such, but in another units of measure model they might not be - e.g., minute could be the base duration unit rather than second, or euro could be the base CurrencyUnit. A second would still be a second even if it weren't a base unit.

This would still be an exception to our general 'most specific rigid class' rule, however: a second would not be a second if it were not a DurationUnit. We have to think of a way to express the exception to our convention in a non-arbitrary way. @uscholdm Thoughts?

uscholdm commented 3 years ago

I did not follolw this fully, probably best to have a chat to make progress.

rjyounes commented 2 years ago

Defer until #305 is implemented so we don't change the names more than once.

rjyounes commented 2 years ago

@dylan-sa Why does this need to be deferred? We can deprecate the existing terms, as is our usual practice.

rjyounes commented 2 years ago

Deprecate existing URIs.

rjyounes commented 2 years ago

Re rigid class issue above: Michael proposes we could use the most specific rigid class in gist, and that clients could handle it themselves - e.g., by defining their own instances with "UnitOfMeasure" infix and owl:sameAs to the gist instances. Current instances to be deprecated.

Open question: are these individuals categories or data? I.e., gistxvs gistd namespace?

uscholdm commented 2 years ago

Open question: are these individuals categories or data? I.e., gistxvs gistd namespace?

A question I ask is whether the information is part of the subject matter of the client business. If it is, it goes in the taxo namespace, otherwise it is data. Let's look at some client examples.

In all three cases, the units and new subclasses of gist:Magnitude (that go hand in hand) are describing the subject matter relating to the core business of the client. They are used to create client data. So to me, the units are a much better fit to be in a gistx namespace than a gistd one.

It would be different if we worked for the international standards organization that tracked all sorts of units and measures. For them, it would probably be as good or better fit to regard the units as data.

rjyounes commented 2 years ago

A question I ask is whether the information is part of the subject matter of the client business

Since gist is an upper ontology, with no defined subject matter, we have to decide whether the instances are taxonomic or data. Once we take one stance with gist, client models should follow suit. It would be odd if a unit defined in gist is data while it's taxonomic in a client model, or vice versa.

Given that UnitOfMeasure and Magnitude are not subclasses of Category, I don't see how these can be taxonomic terms - on the assumption that only categories are taxonomic terms, but I'm not sure this is a valid assumption. In addition, they serve as subjects and objects of predicates such as hasBaseUnit, hasStandardUnit`, and so on, suggesting they are data - I.e., we have things to say about them other than where they fit in a hierarchy. It doesn't make sense to say that a category has a base unit.

One might say that if it's a controlled vocabulary it's taxonomic. At the same client, we have a curated set of suppliers and manufacturers (organizations). Again, Organization is not a subclass of Category, and in addition these instances do things, such as selling and manufacturing products.

uscholdm commented 2 years ago

This is a not a clear-cut decision - your points are generally valid. It's a matter of considering tradeoffs and personal preferences. On the other hand:

It occurs to me that if we go with the International Standards Body, a unit of measure is just a Magnitude - we went through this once before and decided against it. But if we did go with this approach, the data namespace seems to make more sense.

rjyounes commented 2 years ago

That is still on the table as part of the units and measures work: #61.

rjyounes commented 2 years ago

This will require more discussion due to the issue of most specific not always being appropriate.

rjyounes commented 1 year ago

Note: although all the gist instances are currently units of measure, I don't think the topic here is units and measures per se, so am removing it from that discussion.

rjyounes commented 1 year ago

I vote to close this issue. There are only a handful of terms at stake, all units of measure, and there is considerable disagreement about whether uoms are taxonomy data or not. IMO it's not worth hashing out.

uscholdm commented 1 year ago

Agree to not address this for now. Is there a 'dormant' status, as opposed to dead?

rjyounes commented 1 year ago

Closing and labelling as deferred.

rjyounes commented 1 year ago

Deferred to discussion of unit and measures: issue #759 and #697

rjyounes commented 11 months ago

@uscholdm @dylan-sa @coltonglasgow What are your thoughts about implementing the naming convention in sub-gists? A couple of them have small taxonomies; should we put them in a gistx: namespace even though that does not currently exist? It seems a good opportunity to follow our best practices.

uscholdm commented 11 months ago

As of 12.0.1 there only a dozen or so instances and every one is related to units and magnitudes. It can be addresses in #697.

image

rjyounes commented 11 months ago

@uschold I was referring to sub-gists.

dylan-sa commented 11 months ago

I like the idea of using gistx: for the taxonomy instances, similar to how we do with client ontologies. One thing to consider: While many of the sub-gist instances would definitely go into gistx:, we have some units of measure, too. Did we reach an agreement about whether UoMs should go into gistx: or gistd:? Maybe we could leave UoMs in gist: for now but move the obvious taxo instances into gistx:?

uscholdm commented 11 months ago

I think we shoud do whatever we normally do with clients, to the extent that we all do the same thing. If there are differences, we should look into them and make a choice.