Remote rdfs:subClassOf triples

edmcouncil / fibo

The Financial Industry Business Ontology (FIBO) defines the sets of things that are of interest in financial business applications and the ways that those things can relate to one another. In this way, FIBO can give meaning to any data (e.g., spreadsheets, relational databases, XML documents) that describe the business of finance.

https://spec.edmcouncil.org/fibo/

MIT License

315 stars 67 forks source link

Remote rdfs:subClassOf triples #874

Closed mereolog closed 3 years ago

mereolog commented 4 years ago

Fibo contains a number of remote rdfs:subClassOf triple statements, that i.e., cases where all three statements are present: class1 rdfs:subClassOf class2. class2 rdfs:subClassOf class3. class1 rdfs:subClassOf class3. For example: NationalSecurityIdentificationScheme rdfs:subClassOf SecurityIdentificationScheme SecurityIdentificationScheme rdfs:subClassOf RegistrationScheme NationalSecurityIdentificationScheme rdfs:subClassOf RegistrationScheme

In such cases the last statement can be inferred from the others and as such is not needed.

Although there is nothing formally wrong with such statements, they unnecessarily clutter the ontology by making it more expensive to maintain. They also violate the DRY principle. Finally they affect visualisation as the viewer displays more parents than needed from the formal point of view.

Here is the list of all such cases in the current master: remote_subclassofs_20200212.xlsx They were collected by the following SPARQL query: SELECT ?sub ?super WHERE { ?sub rdfs:subClassOf ?super. ?sub rdfs:subClassOf/rdfs:subClassOf+ ?super.} over all the aggregated RDF graphs over all rdf files in the fibo repo.

Perhaps it would make sense to add this to the hygiene tests - not as a check that throws errors but maybe as a warning check. After all they might be an informal reason for keeping a remote.

dallemang commented 4 years ago

We have a number of other DRY principles we enforce in FIBO using hygiene tests (e.g., don't refer to owl:Thing as a domain or range). We do intentionally violate DRY from time to time, but only if there is some expository reason to do so (I can't actually remember such a case, I just remember that there has been one)

This makes sense to me, and like you say, it is easy to fix.

Do keep in mind that there could be modularity issues; e.g., A subClassOf B could be in one ontology, B subClassOf C in another, but we cannot necessarily expect that second ontology to be imported, so we assert the apparently redundant A subClassOf C to reduce the dependency of the first ontology on the second. I don't know if this really happens, but it could.

rivettp commented 4 years ago

This has exposed a worse problem - a cycle - with https://spec.edmcouncil.org/fibo/ontology/DER/ExchangeTradedDerivatives/ExchangeTradedOptions/TradedOptionPrincipal which is :

Where the latter is:

rivettp commented 4 years ago

Some of these reveal some deeper structural flaws which require issues in their own right . e.g. Organization is a subclass of IndependentAgent (which is rather non-usefully defined as "any person or organization") and the latter is a subclass of AutonomousAgent whose definition seems to exclude Organizations "An agent is an autonomous individual that can adapt to and interact with its environment."