How to model images and how to integrate images URLs?

chin-rcip / collections-model

Linked Open Data Development at the Canadian Heritage Information Network - Développement en données ouvertes et liées au Réseau canadien d'information sur le patrimoine

Creative Commons Zero v1.0 Universal

12 stars 1 forks source link

How to model images and how to integrate images URLs? #17

Closed chin-rcip closed 3 years ago

chin-rcip commented 4 years ago

For the moment we haven't found any solution to model an pattern that depicts images, and makes a difference between the URL of the image, the digital image, the original physical image if it exists, and the link to the actor.

Habennin commented 4 years ago

I am currently working on the Pharos project, a consortium of photo archives. The key question is modelling images. Thus far, we are pretty satisfied with our modelling. In brief the idea is that you model the basic entity as E36 Visual Item and then from there you indicate its materializations which are either connected through p128 to an E22 or p165 to a D1 digital object. A URI is just an instance of E42 with a type 'URI'.

stephenhart8 commented 4 years ago

If I follow what you have done with the Pharos project, we would have the following model: CiC_Issue17-example1

My questions:

What is the difference between the property P128 carries and P65 shows visual item, that seems here more appropriate?
The actual URL would be a literal linked to the E41 Appellation with the property P190 has symbolic content?

Habennin commented 4 years ago

p65 is just a specialization of p128 to say that what is carried is an image.
Yes. there is a dodgy issue here in that a URL and a URI are very close in form so that in fact your URI for your appellation node might be www.something.org/1 and the value put in the RDF literal will also be www.something.org/1. This is redundant, but unavoiable if you want to be able to do things with the appellation node like type it.

stephenhart8 commented 4 years ago

To document the source of the image (where the image comes from), could we add dct:source to the D1 Digital Object?

illip commented 4 years ago

I don't see any reason why we couldn't use dct:source except if we decide to use a CRM pattern like P67i -> E73 -> P2 -> E55 (Source).

That said, I would like to keep this issue open to get a better understanding of the issue. Three items in particular:

Are we happy with the CRM pattern for the image URL or we would prefer something like schema:contentURL?
Do we all agree that our digital images cannot be deleted because they are conceptual objects? I'm still not convince of this semantic choice?
Where do we draw the line between one Visual Item and another? Since the Visual Item is the only link between our E24 and D1, it's important to be sure that they will never be divided according to a visual issue like: "Oh I don't think it's the same visual item since I can't see the textures on the digital object."

stephenhart8 commented 4 years ago

The pattern I proposed on the 12th of December has been added to the TM 2.0 and will be tested in the following months. It has been decided on the 20th of December to leave this issue open in order to continue the discussion, and see if any other pattern seems better.

illip commented 4 years ago

To help us understanding the E38_Visual_Item, this Linked Art discussion might be useful: https://github.com/linked-art/linked.art/issues/289

Habennin commented 4 years ago

what is the semantic meaning of source? Does it mean someone who owns or provides the digital image?

That said, I would like to keep this issue open to get a better understanding of the issue. Three items in particular:

Are we happy with the CRM pattern for the image URL or we would prefer something like schema:contentURL?

would you put this directly off the image? By creating an instance of D1 Digital Object, you recognize that the digital image has a separate identity, which is actually true.

Do we all agree that our digital images cannot be deleted because they are conceptual objects? I'm still not convince of this semantic choice?

A copy of the digital object could be deleted, but not the pure idea of the digital object until all copies of it are destroyed. Would you track that anyway?

Where do we draw the line between one Visual Item and another? Since the Visual Item is the only link between our E24 and D1, it's important to be sure that they will never be divided according to a visual issue like: "Oh I don't think it's the same visual item since I can't see the textures on the digital object."

If you don't have the means to distinguish them because of lack of documentation, assume they are the same, to make life feasible?

illip commented 4 years ago

Hi @Habennin,

what is the semantic meaning of source? Does it mean someone who owns or provides the digital image?

Good question, I think we would like to capture both information. It will be important to distinguish both in order to manage properly the rights. @stephenhart8 what do you think?

Are we happy with the CRM pattern for the image URL or we would prefer something like schema:contentURL?

would you put this directly off the image? By creating an instance of D1 Digital Object, you recognize that the digital image has a separate identity, which is actually true.

I totally agree that we need a specific node to represent the digital object. :)

Do we all agree that our digital images cannot be deleted because they are conceptual objects? I'm still not convince of this semantic choice?

A copy of the digital object could be deleted, but not the pure idea of the digital object until all copies of it are destroyed. Would you track that anyway?

So in this case, do we need another node to represent a specific digital object? In other words, the URL is a way to identify a location (no worries, I don't want to open a E53_Place discussion here) of a specific digital representation. I know that we can attach several appellations to an entity but in this case, I'm not sure the URL apply to the purely conceptual object, no? I guess I could colorize a specific digital object with its own URL that will perhaps refer to the purely conceptual one and even if I colorized my version, it still incorporates the same visual item. Perhaps, I'm completely misunderstanding here, but for me, it's not clear which instances should be under "D1_Digital_Object".

Where do we draw the line between one Visual Item and another? Since the Visual Item is the only link between our E24 and D1, it's important to be sure that they will never be divided according to a visual issue like: "Oh I don't think it's the same visual item since I can't see the textures on the digital object."

If you don't have the means to distinguish them because of lack of documentation, assume they are the same, to make life feasible?

I still think this is something we should state clearly because if someone thinks that he should create a distinct E36 according to whatever reason we won't be able to connect E24 and D1. So I guess we should explain clearly that when you have a digital version of something physical we should always use the same E36_Visual_Item for both and this is how our mapping process will proceed. That said, I don't think we will have this kind of issue now because the E36 won't appear in our checklist/Reference.

stephenhart8 commented 3 years ago

In order to facilitate the discussions on this issue, I have summarized the different questions that needs to be answered:

À partir de quand existe-il une distinction entre deux E36 Visual Item? Si un artiste retouche une oeuvre, crée-il un nouveau E36 Visual Item? Si oui, ce nouveau E36 Visual Item contient-il le E36 Visual Item de l'euvre originelle?

What is the difference between two E36 Visual Item? If an artist transform an already existing image, does he create a new E36 Visual Item? If so, does this new E36 Visual Item contains the E36 Visual Item of the original image?
D'une même manière, quelle est la différence entre deux D1 Digital Object? Est-ce que une résolution d'image différente implique la nécessité de créer un nouveau D1 Digital Object?

Similarily, what is the difference between two D1 Digital Object? Does a different image resolution implies a new D1 Digital Object?
Est-ce que P138 represents entre le E36 Visual Item et le E39 Actor est suffisante? Ou faut-il avoir plus de précision dans la description de l'image?

Is the property P138 represents linking the E36 Visual Item to the E39 Actor is enough for describing actors, or should there be a greater precisious in the description of images?
Devrait-on utiliser une autre ontologie pour documenter l'URL d'une image, comme schema:contentURL?

How to model the URL of a digital image? Should we use a simple non-CIDOC ontology, like schema:contentURL?
Si l'on utilise l'ontologie CIDOC CRM, comment représenter les URL des images numériques? Faut-il utiliser la classe E41 Appellation (car la classe E45 Address a été dépréciée) ou E42 Identifier?

If we decide to use CIDOC CRM, how to model URLs? Should we model them as E41 Appellation (as E45 Address has been deprecated) or as an E42 Identifier?
Selon CIDOC CRM, l'URL de l'image devrait elle être le contenu de la classe E41/E42 ou être l'URI de cette classe?

Following CIDOC CRM, where should the URL of the image be? Should it be the content of the E41/E42 Class or the URI of that class?
Est-ce que l'image numérisé doit être considérée comme un objet physique ou plutôt un objet conceptuel? En effet, la classe D1 Digital Object est rdfs:subClassOf de E28 Conceptual Object. Cela signifie que conceptuellement, l'image numérique ne peut pas être détruit. Est-ce problématique?

Should the digital object be considered a physical object or a conceptual one? The class D1 Digital Object is rdfs:subClassOf of E28 Conceptual Object. This means that a digital object cannot be destroyed. Could this be problematic?
Dans le modèle cible 2.0, il n'y a pas de sources associées à l'image. Quelle type de sources devrait-on considérer incorporer? Faut-il mentionner l'acteur qui a fournie l'image, ou l'acteur qui détient les droits? Les deux sont-ils nécessaires? Il me semble que le fournisseur de l'image est déjà documenté dans le Named Graph

In the Target Model 2.0, there is no source associated with the image. What type of source do we want to have: the actor who provided the image or the actor who holds the rights of the image? It seems that the provider is already mentionned in the Named Graph.
Comment documenter la source d'une image? Est-il possible de recourir à l'ontologie DublinCore, notamment la propriété dct:source? Ou faut-il rester au sein de l'ontologie CIDOC CRM?

How to document the source of an image? Should we use the ontology CublinCore, with the property dct:source? Or should we stay within CIDOC CRM?

stephenhart8 commented 3 years ago

From the discussions of the Semantic Committee on the 4th of February 2021, it has been decided that:

The distinction between different instances of E36_Visual_Item is made by domain experts. The model must allow the creation of links between instances of E36_Visual_Item and this should be added to the Target Model. One option is to link the Visual Item that is incorporating another Visual Item with the property P165_incorporates, as follows:
The description of Actors depicted in E36_Visual_Item is insufficient in the Target Model 2.0. The property P138_represents does not allow to document how the actor is represented. It is, therefore, necessary to change the pattern. Few options have been proposed: Using the P138.1_mode_of_representation to document how the actor is represented, or by documenting the event the E39_Visual_Item depicts. In order to choose the appropriate pattern, it is first necessary to list the different fields used to document the mode of representation of an actor in an image. This will be done for the next Semantic Committee meeting on the 4th of March.

marielmat commented 3 years ago

As requested on February meeting, here are some examples of items that represent people and the description of these items.

The information about the "subject" can be found in several fields of our database. For the purpose of this specific issue, I chose examples where the subject represented is described under "Title" and / or "Description".

"Description" correspond to Info-Muse definition. In our database, we "share" this field with "Scope and content" (from the RDDA) for the archives description. The quality of the descriptions is uneven.

2006-981, 2008-1716, 1993.15019, PH2018-198

stephenhart8 commented 3 years ago

Thank you @marielmat for the uses cases

From what I have collected, the information pertaining to the descriptions of actors within images is as follows:

2006-981

Portrait en buste représentant Nicolas Vincent de face vêtu du costume traditionnel des Hurons-Wendats orné de médailles, d'épaulettes ainsi que de brassards.

2008-1716

Peinture polychrome à l'huile sur toile rectangulaire. Portrait en pied de soeur Marie-Bertille de l'Eucharistie en costume de religieuse regardant de face et tenant un crucifix dans ses mains. Paysage à l'arrière-plan et bouquet de lys à l'avant plan. Cadre de bois peint avec inscriptions et armoiries.

1993.15019

Tirage 3/4.

PH2018-198

Photographie de groupe d'élèves et de prêtres prise à l'extérieur lors de la fête de la Saint-Louis. Monseigneur Paul-Eugène Roy est assis au centre de la première rangée.

Those descriptions are full text, and does not describe more semantically the representation of actors. But as highlited, some common descriptors can be gathered and a few fields could be created:

The "shot" of the representation (term coming from filmmaking). It is to say if the actor represented is tatally visible ("portrait en pied"), or partly ("en bust", etc.)
The orientation of the actor. It is to say if an actor is viewed in front ("de face", on the side ("3/4"), etc.
The localisation of the actor within a Visual Item. This can be distinguished between the localisation in regard of the visual item (on the left of the image, etc.) or in regard of other elements within the visual item (on the left of person X, etc.)
The interaction with other entities (costume weared, object held, etc.). The description of this interaction will vary depending on whether the other entity is also documented in a museum. This is the case if an object depicted is also in the museum collection. If it is the case, the description of the interaction can be made in the pattern of the object depiction.

There is two ways to of describing images, both that can be used at the same time:

With a text field, as it is done at the Musée de la Civilisation
With some vocabulary terms in a structured maner

The text field can be done with the use of an E33_Linguistic_Object associated to the E36_Visual_Item.

The more structured description of the representation of an actor through vocabulary terms is more difficult. One way is to use the property P138.1_mode_of_representation, linking the class PC138_Represents with an E55_Type. The scope note of P138.1_mode_of_representation is ambiguous:

“P138.1 mode of representation allows the nature of the representation to be refined” p. 123

It is unclear if this property can be used for the description of actors within visual items, or if it should be used the medium (for example that an actor is represented painted, or photographed, etc.).

My proposition for the description of actors in visual items is as follows:

Image description-Page-1