tdwg / dwc

Darwin Core standard for sharing of information about biological diversity.
https://dwc.tdwg.org
Creative Commons Attribution 4.0 International
201 stars 70 forks source link

New Term - objectType #517

Open ben-norton opened 1 week ago

ben-norton commented 1 week ago

New term

New term

Term as documented below is 'borrowed' from Latimer Core. Term Name: ltc:objectType https://ltc.tdwg.org/terms/index.html#ObjectGroup_objectType

Proposed attributes of the new term:

tucotuco commented 1 week ago

Material Sample Task Group investigated, but abandoned as out of scope, the consideration of a MaterialEntity type term (see https://github.com/tdwg/material-sample/issues/33 and https://github.com/tdwg/material-sample/issues/14) and a controlled vocabulary for it (https://github.com/tdwg/material-sample/issues/24). The term as discussed would have ended up being materialEntityType following the pattern of type properties for classes in Darwin Core. This term was purposefully avoided by the Task Group because of myriad implications with respect to basisOfRecord, object type hierarchies, etc., but the discussion was extremely valuable and should be considered by anyone with a vested interest in this term proposal.

ben-norton commented 1 week ago

Is there an alternate path forward for the term? Currently, most values are being cobbled together in preparations, which can be confusing and misunderstood. I'm somewhat familiar with the Material Entity discussions. My hope was to keep objectType simple so that we could solve the problem without getting bogged down by that debate. I may have been overly optimistic about the prospect of doing so. I added it to MaterialEntity mainly because that's where preparations resides. The idea is to narrow that term to a more appropriate level of granularity and avoid the complex material entity discussion.

cboelling commented 1 week ago

The issues that came up when trying to establish a dwc:MaterialEntityType property are due to dependencies that arise from the practical usage of other DwC terms rather than the generality of dwc:MaterialEntity. As such, implementing dwc:ObjectType, if it is understood to indicate the type of a particular object and as such indicates asubclass of dwc:materialEntity, would be subject to exactly the same concerns (spelled out in the links above). What the earlier discussion indicates, I think, is that there are limitations of DwC expressivity when it comes to describing object histories and curatorial activities within that object history.

Setting this aside for a moment (i.e. adopting a bag of terms perspective, nothing else), I agree that data providers and data consumers must have a way to indicate the type of an object - using categories that matter to either party based on the use cases for the data.

If this is handled in an completely open fashion, i.e. allow any value that matters to a stakeholder, this is easy.

If the vocabulary is to be structured and controlled in some way it gets complicated quickly because objects can be categorized alongside many different facets and more often than not categories in one facet are (expected to be) ordered hierarchically or related in other ways.

atomizing the information into a more logical set of properties

I understand that the proposal is to use a dwc:objectType property alongside dwc:preparations (possibly updating its definition) and possibly further properties as a "more logical set of properties" that describe different facets of the object.

If this is correct then what are these facets and which logic is applied? The example list of terms in the OP seems to combine vastly different facets. Also, some of the examples do not in fact generalize values of dwc:preparations (e.g. macrofossil vs. fossil) as the proposed usage comments suggest.

For the record: I do think that the definition of dwc:preparations carries ambiguities that should be resolved (Preparations and preservation methods, which are both mentioned in the definition are different categories). Based on the examples for dwc:preparations I would say it is intended to capture the type of an object along the lines proposed in this NTR.

ben-norton commented 1 week ago

Here's the logic.A fossil is not a preparation. A fossil is a type of object. From my purview, it's not logical to call it a preparation. If you were to wrap it in a jacket, then that would be a preparation. I've "prepared" the specimen. Latimer Core kept this simple to avoid the problems of material entity subtype.

Preservation and Preservation Mode are in ABCD.

matdillen commented 4 days ago

@ben-norton In the current situation, whether a Darwin Core record is a fossil can be deduced from basisOfRecord. What would be any further objectType kind of data that would not fit well in dwc:preparations?

(Regardless of the current example values in the DwC term list, that is a separate discussion.)

From the MIDS perspective, we currently have a MIDS element called ObjectType that maps to dwc:preparations (see this issue and this line in the mapping). This mapping was mainly made with preserved specimens in mind, not fossils. It could be omitted from mapping sets for paleontology, or moved to another MIDS level, if it is not a key piece of information to digitise and publish in that discipline.

wouteraddink commented 4 days ago

@matdillen the current SSSOM mapping for mids:ObjectType is skos:exactMatch to dwc:preparations. But MIDS was created with all specimen types in mind, including fossils, geological objects, preserved DNA and living specimens. So if you say the mapping was created with preserved specimens in mind, that should maybe changed? Should it not be broad match?

as Ben already wrote, in LtC there is also an objectType defined as "A more generic classification of items in the collection than described in preparationType" and the intention of mids:ObjectType was to be compatible with that. Since this term is about items in a collection, would it not be nice if there is an equivalent for a single collection item in DwC as Ben suggests?

The definition as Stephen Richard once proposed seems to make sense: ObjectType is a description of countable things. I think that is much clearer than preparation type which is about how something is made ready for a particular purpose and would include things like dried, preserved in alcohol but these could still be captured in ObjectType as dried material or jar with alcohol. So changing dwc:preparationType to dwc:objectType would also be an option.

matdillen commented 4 days ago

The mappings are going to be discipline-specific, so we have at least distinct mapping sets for (preserved) biology, geology and paleontology. DNA samples and living organisms have not been discussed (yet).

So the mapping in the SSSOM tsv file I linked means that "the MIDS element ObjectType, in the scope of this mapping set, exactly matches dwc:preparations". This may be different in other mapping sets, and may also (need to) change if Darwin Core changes.

I'm not against adding a property like objectType, building on the definitions in Latimer Core. But for now, I'll already be very happy if people use dwc:preparations [for preserved specimens], as it can already go a long way in making specimen data more interoperable. Hence I'd love to hear from Ben what additional needs he sees in paleontology that could be addressed by ObjectType.

Since this term is about items in a collection, would it not be nice if there is an equivalent for a single collection item in DwC as Ben suggests?

I'm still not very familiar with Latimer Core, but this question did prompt me to wonder whether data at the individual specimen level could be fully modeled using the Latimer Core terms and structure.

ben-norton commented 4 days ago

@matdillen 95% of specimens in earth sciences (geology, mineralogy) collections would fall into this category. All specimens have an object type, but not all specimens have a preparation. For biological collections, most specimens are prepared in some manner for preservation purposes. So little is lost in this regard. In geology, this is not the case. I have a 400 million-year-old limestone on my bookshelf. I don't need to prepare it any further for preservation purposes. Preparation is done for specific purposes such as powdering for a variety of analyses or taking a slice for a thin section. However, most specimens aren't and, therefore, wouldn't use the preparation field. I refer back to what @wouteraddink and Stephen Richard say. For paleontology, the landscape is more complicated. Preparation has a specific meaning, although usage is very broad. This seems contradictory, but it's not and indicative of the underlying issue. In general, preparation can be synonymous with preservation. Here, there are two types: natural and anthropogenic. Think of a bug in amber as natural and a specimen in a jacket as anthro. Then there's the object type, which gets awkwardly lumped in, but should not, which includes things like cast and trace fossil. Object type term would make this distinction clearer and help resolve the ambiguous term use.