Closed amoeba closed 3 years ago
@mpsaloha sent me a v1 draft but indicated that we wanted to materialize inferred axioms in the OWL file. He was having trouble getting the Protégé "Export inferred axioms as ontology..." feature to run at all.
I tried it on my end and it ran fine and materialized the kinds of inferred axioms we were looking for. But then we noticed the result is a bit funny: It seems like it copies from some annotations from the imported ontologies. See this gist of the differences. Note I had to remove the provone import for robot to run the diff so you won't see a line item in the diff showing the missing import. 95+% of what's in the diff looks good so I'm thinking we can just do some minor touch-up here.
On another note, it looks like SSN and SOSA follow a pattern that looks really nice. They offer up the following annotations:
Would it make sense for us to add these? We have some of this info embedded in rdfs:comments but I think more specialized annotations would be better.
I took Mark's last version, exported an inferred copy, re-added PROVONE, SSN, and SOSA imports, and took an initial attempt at some of the annotations in the list above (dcterms:title, etc). I'll touch base with @mpsaloha and the rest of the team tomorrow to try wrapping things up.
Oh, and I threw up a PyLODE page for the latest copy of the draft at https://60dd1c46a436a937e61a6874--reverent-austin-ec584b.netlify.app if anyone wants to check it out. You can see some quick things that we might tweak, like creators, a better description, and an overview image.
We're close but not quite there yet on this. @mpsaloha did some more work and sent me an update. I made two commits:
A few open questions remain:
http://purl.dataone.org/odo/MOSAiC/
but a natural prefix URI for our terms would be http://purl.dataone.org/odo/MOSAiC_
. Should we tackle this?Trying to follow new proposed contributing guidelines, I renamed the branch from MOSAiC
to feature-88-mosaic
and I would like to delete the original branch, but let's discuss.
Before I make the above changes, I wanted to check and see if there were usages of MOSAiC in the wild and I see four: https://search.dataone.org/cn/v2/query/solr/?q=sem_annotation:*MOSAiC*&fl=id,sem_annotation. I'm going to coordinate with @laijasmine to make sure we can update those annotations.
Edit: We also decided today to try locally versioning the ontology into two versions that stay in sync: An uninferred/raw version and the fully-inferred one. The latter would be the official copy and the copy we'd distribute and the former would only be kept in git. Updates to the ontology would go in two places.
This is pretty tricky but we're hoping it'll be more beneficial than it is painful. @mpsaloha thinks the inferred triples are critical to the serialized ontology. So far our experience with using Protégé to export inferred axioms has been that it doesn't work if you've already exported so we need to keep a pre-inferred copy around to (1) edit, (2) re-infer + export.
@mpsaloha sent me the pre-inferred copy today and I re-inferred the full copy and made all the URI tweaks above. I wrote up a quick readme in the folder https://github.com/DataONEorg/sem-prov-ontologies/tree/feature-88-mosaic/MOSAiC.
I'm going to touch base with @mpsaloha so he can look at the final copy but I think we can move forward from here.
The MOSAiC ontology is ready to be merged onto main in https://github.com/DataONEorg/sem-prov-ontologies/pull/94. GitHub is telling me a PR onto main needs a reviews. @mbjones could we remove that restriction? At any given time, we'll likely only have one person on staff (me) that has their head deep enough in this stuff to do a real review.
Restriction removed, and I submitted an approving review. I think this issue can be closed now.
This issue still has a few remaining items to be done but I realize that others might prefer we break them out into their own issues. I'll do that now and close this when done.
The remaining tasks here have been broken out into their own issues. See updated OP.
@mpsaloha and @laijasmine are nearly ready with the first version of the MOSAiC ontology. This'll go in a branch and we'll want to merge to main and cut a GH release.
Decide on place to host the ontologyMoved to https://github.com/DataONEorg/sem-prov-ontologies/issues/96.Put in purl.dataone.org redirectsMoved to https://github.com/DataONEorg/sem-prov-ontologies/issues/97.Reindex affected contentRe: Decide on place to host the ontology...
We've been hosting ontologies like ECSO on BioPortal and dereferencing our URIs there. Unfortunately, MOSAiC uses lots of Instances it looks like BioPortal doesn't yet have great support for this. We might be better off with something like a PyLODE page or hosting our own OLS or triplestore. I think the easiest thing would be to start with a PyLODE page and see how far that gets us.
Where we "host" the ontology could be multiple places. One could be for de-referencing (PyLODE) and another, for example, could be for building web interfaces upon (like we do now with BioPortal).