chin-rcip / collections-model

Linked Open Data Development at the Canadian Heritage Information Network - Développement en données ouvertes et liées au Réseau canadien d'information sur le patrimoine
Creative Commons Zero v1.0 Universal
12 stars 1 forks source link

How to model images and how to integrate images URLs? #17

Closed chin-rcip closed 3 years ago

chin-rcip commented 4 years ago

For the moment we haven't found any solution to model an pattern that depicts images, and makes a difference between the URL of the image, the digital image, the original physical image if it exists, and the link to the actor.

Habennin commented 4 years ago

I am currently working on the Pharos project, a consortium of photo archives. The key question is modelling images. Thus far, we are pretty satisfied with our modelling. In brief the idea is that you model the basic entity as E36 Visual Item and then from there you indicate its materializations which are either connected through p128 to an E22 or p165 to a D1 digital object. A URI is just an instance of E42 with a type 'URI'.

stephenhart8 commented 4 years ago

If I follow what you have done with the Pharos project, we would have the following model: CiC_Issue17-example1

My questions:

  1. What is the difference between the property P128 carries and P65 shows visual item, that seems here more appropriate?
  2. The actual URL would be a literal linked to the E41 Appellation with the property P190 has symbolic content?
Habennin commented 4 years ago
  1. p65 is just a specialization of p128 to say that what is carried is an image.

  2. Yes. there is a dodgy issue here in that a URL and a URI are very close in form so that in fact your URI for your appellation node might be www.something.org/1 and the value put in the RDF literal will also be www.something.org/1. This is redundant, but unavoiable if you want to be able to do things with the appellation node like type it.

stephenhart8 commented 4 years ago

To document the source of the image (where the image comes from), could we add dct:source to the D1 Digital Object?

illip commented 4 years ago

I don't see any reason why we couldn't use dct:source except if we decide to use a CRM pattern like P67i -> E73 -> P2 -> E55 (Source).

That said, I would like to keep this issue open to get a better understanding of the issue. Three items in particular:

  1. Are we happy with the CRM pattern for the image URL or we would prefer something like schema:contentURL?
  2. Do we all agree that our digital images cannot be deleted because they are conceptual objects? I'm still not convince of this semantic choice?
  3. Where do we draw the line between one Visual Item and another? Since the Visual Item is the only link between our E24 and D1, it's important to be sure that they will never be divided according to a visual issue like: "Oh I don't think it's the same visual item since I can't see the textures on the digital object."
stephenhart8 commented 4 years ago

The pattern I proposed on the 12th of December has been added to the TM 2.0 and will be tested in the following months. It has been decided on the 20th of December to leave this issue open in order to continue the discussion, and see if any other pattern seems better.

illip commented 4 years ago

To help us understanding the E38_Visual_Item, this Linked Art discussion might be useful: https://github.com/linked-art/linked.art/issues/289

Habennin commented 4 years ago

what is the semantic meaning of source? Does it mean someone who owns or provides the digital image?

That said, I would like to keep this issue open to get a better understanding of the issue. Three items in particular:

Are we happy with the CRM pattern for the image URL or we would prefer something like schema:contentURL?

would you put this directly off the image? By creating an instance of D1 Digital Object, you recognize that the digital image has a separate identity, which is actually true.

Do we all agree that our digital images cannot be deleted because they are conceptual objects? I'm still not convince of this semantic choice?

A copy of the digital object could be deleted, but not the pure idea of the digital object until all copies of it are destroyed. Would you track that anyway?

Where do we draw the line between one Visual Item and another? Since the Visual Item is the only link between our E24 and D1, it's important to be sure that they will never be divided according to a visual issue like: "Oh I don't think it's the same visual item since I can't see the textures on the digital object."

If you don't have the means to distinguish them because of lack of documentation, assume they are the same, to make life feasible?

illip commented 4 years ago

Hi @Habennin,

what is the semantic meaning of source? Does it mean someone who owns or provides the digital image?

Good question, I think we would like to capture both information. It will be important to distinguish both in order to manage properly the rights. @stephenhart8 what do you think?

Are we happy with the CRM pattern for the image URL or we would prefer something like schema:contentURL?

would you put this directly off the image? By creating an instance of D1 Digital Object, you recognize that the digital image has a separate identity, which is actually true.

I totally agree that we need a specific node to represent the digital object. :)

Do we all agree that our digital images cannot be deleted because they are conceptual objects? I'm still not convince of this semantic choice?

A copy of the digital object could be deleted, but not the pure idea of the digital object until all copies of it are destroyed. Would you track that anyway?

So in this case, do we need another node to represent a specific digital object? In other words, the URL is a way to identify a location (no worries, I don't want to open a E53_Place discussion here) of a specific digital representation. I know that we can attach several appellations to an entity but in this case, I'm not sure the URL apply to the purely conceptual object, no? I guess I could colorize a specific digital object with its own URL that will perhaps refer to the purely conceptual one and even if I colorized my version, it still incorporates the same visual item. Perhaps, I'm completely misunderstanding here, but for me, it's not clear which instances should be under "D1_Digital_Object".

Where do we draw the line between one Visual Item and another? Since the Visual Item is the only link between our E24 and D1, it's important to be sure that they will never be divided according to a visual issue like: "Oh I don't think it's the same visual item since I can't see the textures on the digital object."

If you don't have the means to distinguish them because of lack of documentation, assume they are the same, to make life feasible?

I still think this is something we should state clearly because if someone thinks that he should create a distinct E36 according to whatever reason we won't be able to connect E24 and D1. So I guess we should explain clearly that when you have a digital version of something physical we should always use the same E36_Visual_Item for both and this is how our mapping process will proceed. That said, I don't think we will have this kind of issue now because the E36 won't appear in our checklist/Reference.

stephenhart8 commented 3 years ago

In order to facilitate the discussions on this issue, I have summarized the different questions that needs to be answered:

  1. À partir de quand existe-il une distinction entre deux E36 Visual Item? Si un artiste retouche une oeuvre, crée-il un nouveau E36 Visual Item? Si oui, ce nouveau E36 Visual Item contient-il le E36 Visual Item de l'euvre originelle?

    What is the difference between two E36 Visual Item? If an artist transform an already existing image, does he create a new E36 Visual Item? If so, does this new E36 Visual Item contains the E36 Visual Item of the original image?

  2. D'une même manière, quelle est la différence entre deux D1 Digital Object? Est-ce que une résolution d'image différente implique la nécessité de créer un nouveau D1 Digital Object?

    Similarily, what is the difference between two D1 Digital Object? Does a different image resolution implies a new D1 Digital Object?

  3. Est-ce que P138 represents entre le E36 Visual Item et le E39 Actor est suffisante? Ou faut-il avoir plus de précision dans la description de l'image?

    Is the property P138 represents linking the E36 Visual Item to the E39 Actor is enough for describing actors, or should there be a greater precisious in the description of images?

  4. Devrait-on utiliser une autre ontologie pour documenter l'URL d'une image, comme schema:contentURL?

    How to model the URL of a digital image? Should we use a simple non-CIDOC ontology, like schema:contentURL?

  5. Si l'on utilise l'ontologie CIDOC CRM, comment représenter les URL des images numériques? Faut-il utiliser la classe E41 Appellation (car la classe E45 Address a été dépréciée) ou E42 Identifier?

    If we decide to use CIDOC CRM, how to model URLs? Should we model them as E41 Appellation (as E45 Address has been deprecated) or as an E42 Identifier?

  6. Selon CIDOC CRM, l'URL de l'image devrait elle être le contenu de la classe E41/E42 ou être l'URI de cette classe?

    Following CIDOC CRM, where should the URL of the image be? Should it be the content of the E41/E42 Class or the URI of that class?

  7. Est-ce que l'image numérisé doit être considérée comme un objet physique ou plutôt un objet conceptuel? En effet, la classe D1 Digital Object est rdfs:subClassOf de E28 Conceptual Object. Cela signifie que conceptuellement, l'image numérique ne peut pas être détruit. Est-ce problématique?

    Should the digital object be considered a physical object or a conceptual one? The class D1 Digital Object is rdfs:subClassOf of E28 Conceptual Object. This means that a digital object cannot be destroyed. Could this be problematic?

  8. Dans le modèle cible 2.0, il n'y a pas de sources associées à l'image. Quelle type de sources devrait-on considérer incorporer? Faut-il mentionner l'acteur qui a fournie l'image, ou l'acteur qui détient les droits? Les deux sont-ils nécessaires? Il me semble que le fournisseur de l'image est déjà documenté dans le Named Graph

    In the Target Model 2.0, there is no source associated with the image. What type of source do we want to have: the actor who provided the image or the actor who holds the rights of the image? It seems that the provider is already mentionned in the Named Graph.

  9. Comment documenter la source d'une image? Est-il possible de recourir à l'ontologie DublinCore, notamment la propriété dct:source? Ou faut-il rester au sein de l'ontologie CIDOC CRM?

    How to document the source of an image? Should we use the ontology CublinCore, with the property dct:source? Or should we stay within CIDOC CRM?

stephenhart8 commented 3 years ago

From the discussions of the Semantic Committee on the 4th of February 2021, it has been decided that:

marielmat commented 3 years ago

As requested on February meeting, here are some examples of items that represent people and the description of these items.

The information about the "subject" can be found in several fields of our database. For the purpose of this specific issue, I chose examples where the subject represented is described under "Title" and / or "Description".

"Description" correspond to Info-Muse definition. In our database, we "share" this field with "Scope and content" (from the RDDA) for the archives description. The quality of the descriptions is uneven.

2006-981, 2008-1716, 1993.15019, PH2018-198

stephenhart8 commented 3 years ago

Thank you @marielmat for the uses cases

From what I have collected, the information pertaining to the descriptions of actors within images is as follows:

2006-981

Portrait en buste représentant Nicolas Vincent de face vêtu du costume traditionnel des Hurons-Wendats orné de médailles, d'épaulettes ainsi que de brassards.

2008-1716

Peinture polychrome à l'huile sur toile rectangulaire. Portrait en pied de soeur Marie-Bertille de l'Eucharistie en costume de religieuse regardant de face et tenant un crucifix dans ses mains. Paysage à l'arrière-plan et bouquet de lys à l'avant plan. Cadre de bois peint avec inscriptions et armoiries.

1993.15019

Tirage 3/4.

PH2018-198

Photographie de groupe d'élèves et de prêtres prise à l'extérieur lors de la fête de la Saint-Louis. Monseigneur Paul-Eugène Roy est assis au centre de la première rangée.

Those descriptions are full text, and does not describe more semantically the representation of actors. But as highlited, some common descriptors can be gathered and a few fields could be created:

There is two ways to of describing images, both that can be used at the same time:

The text field can be done with the use of an E33_Linguistic_Object associated to the E36_Visual_Item.

The more structured description of the representation of an actor through vocabulary terms is more difficult. One way is to use the property P138.1_mode_of_representation, linking the class PC138_Represents with an E55_Type. The scope note of P138.1_mode_of_representation is ambiguous:

“P138.1 mode of representation allows the nature of the representation to be refined” p. 123

It is unclear if this property can be used for the description of actors within visual items, or if it should be used the medium (for example that an actor is represented painted, or photographed, etc.).

My proposition for the description of actors in visual items is as follows:

Image description-Page-1

Questions

Christian-McCord commented 3 years ago

Les descripteurs utilisés par le Musée McCord afin de décrire une personne dans une œuvre ont été développés à une époque où les bases de données n’intégraient pas d’images. Le musée adhère au RCIP en 1988 et entreprend le catalogage de ses collections avec le logiciel Advanced Revelation, en version DOS. Les champs Title (différents types), Subject et Description qui servent à identifier et à décrire les personnes compensent l’absence d’une image en étant le plus descriptif possible. Nous utilisons encore ce modèle aujourd’hui avec The Museum System malgré la possibilité de lier une image à l’enregistrement. D’autres champs textes peuvent apporter des informations supplémentaires : Notes, _Curatorial’s Remark_s, Label Text (cartel d’une exposition), Historical Attributions (registres) et Texts Entries (extraits de textes publiés).

Champ Subject – mots-clés et descripteurs iconographiques : Les œuvres iconographiques sont décrites avec des descripteurs de différents niveaux selon le système proposé par Jane E. Falk dans son ouvrage Subject Authority List (San Francisco Museum of Modern Art, 1983). D’un niveau à l’autre, on passe d’une catégorie générique (ex : Portrait) à une sous-catégorie plus spécifique (ex : male), puis la personne est identifiée avec son nom au 3e ou 4e niveau, et avec divers attributs pour les niveaux subséquents. Afin d’établir des liens entre nos différentes collections, des attributs spécifiques réfèrent parfois à celles-ci (ex : Ethnology, aboriginal; fashion, costume, historical, victorian; art, painting; etc.). Chaque catégorie se termine avec un point-virgule pour en introduire une autre. Le Subject Authority List de Falk est ensuite adapté à la spécificité photographique de la collection Notman lorsque celle-ci commence à être cataloguée à partir de 1996. Ex : la catégorie Genre Scene est remplacée par la catégorie Occupation, et une catégorie d’attributs est ajoutée au Thésaurus : Portrait Format (full-length, half-length, head-and-shoulders, three-quarter-length, standing, sitting, etc.). Enfin, pour des représentations de personnes sur des objets de culture matérielle nous utilisons le thésaurus iconographique de Garnier.

Exemples :

  1. Le portrait de Nicolas Vincent (proposé également par @marielmat, 2006-981) image
Christian-McCord commented 3 years ago
  1. Une photo par Jules Ernest Benoît dit Livernois: image
Christian-McCord commented 3 years ago
  1. Un portrait de groupe par Alexander Henderson: image
Christian-McCord commented 3 years ago
  1. Un tableau par Cornelius Krieghoff: image
Christian-McCord commented 3 years ago
  1. Un éventail de fabrication autochtone comprenant des ferrotypes image
Christian-McCord commented 3 years ago
  1. Une caricature par R. Pier publiée dans le Journal de Montréal en 1977 image
stephenhart8 commented 3 years ago

Merci @Christian-McCord pour ces nombreux exemples!

I have quickly looked at them, and it helps to define what we need to describe actors within images.

There are three ways of documenting the subjects of Propositional Object (that contain texts, images, etc.). By a more general property P67_refers_to and to sub-properties P129_is_about and P138_represents.

The property P67_refers_to is very general as it can link any propositional object to another entity it is mentioning, in any way possible. P129_is_about is a more precise property, is it can only be used to describe the main topic(s) of a propositional object, even if there can be multiple ones. Finally P138_represents is a property that directly links a visual item to something it refers to. It does not need to be the main subject.

In the pattern for image description, the only two properties needed are, in my opinion, P138 to identify any entity that is depicted, and P129 when the subject(s) of the visual item has(have) to be documented. What belongs to the subject of is merely an entity depicted is decided by the cataloguer.

In the case of example 6 (M2009.80.15), one option would be to have:

Example_M2009 80 15-2

The distinction between the mode of representation of an actor and the subject or representation of concepts can vary depending on the cataloger. In the case of the example "moustache" of image M2017.46.2.4533 or "smoking" of M2000.95.1, it could either be possible to document the mode of representation of an actor (the person is represented with a moustache or drinking) but it could also be possible to document that those visual items represent the occupation of smoking (M2000.95.1) or has as one of its subjects "moustache" (M2017.46.2.4533).

We will be able to discuss more on the next meeting on the 4th of March

stephenhart8 commented 3 years ago

From the discussions at the semantic committee on the 4th of March, a common agreement on the pattern for Images has been found.

064_Pattern_VisualItemImageUrl_p

The modifications of the pattern have been made in the TM 2.1, except for the description of images and sources, as it will be discussed and decided in other issues. Therefore, this issue is closed.