polifonia-project / sonar2021_demo

This repository is created for the documentation of the Polifonia demo that is going to be presented to SONAR2021
https://polifonia-project.github.io/sonar2021_demo/
2 stars 0 forks source link

OntologyDesign sprint: iteration 1 (places) #14

Open jonnybluesman opened 3 years ago

jonnybluesman commented 3 years ago

As @enridaga suggested, at the current iteration we are focusing on basic concepts + those related to places. Together with @andreamust and @valecarriero, we have filtered the place-related "fields" (from our list [1]) and tried to contextualise them with the current (and the potential) design of the ontology modules.

In sum, these are the data types related to places, with the corresponding entries in [1]:

  1. Place mentioned / subject of the composition, e.g. "Penny Lane, Liverpool, UK" in "Penny Lane" by The Beatles (10);
  2. Facts / curiosities related to the latter, e.g. "In Penny Lane, there is a barber showing photographs..." (11);
  3. Country of each of the artists of a composition, e.g. "GB" (45);
  4. Geographical area where each artist (of a composition) has started their career (46);
  5. Recording place (studio, live venue) of the recording, e.g. "Abbey Road Studios: Studio 2" (65);
  6. Address of the recording place (related to the latter), e.g. "3 Abbey Road, St John's Wood, London" (66);
  7. Places where the release (album) was "made official", e.g. US, EU (75).

Please note that this is more complex from a technical point of view, as MusicBrainz is very granular and provides information at different levels. Indeed, a musical work (the composition) can have N recordings, each of which can be present in M releases. For data collection, we tried to simplify this step, so we can assume a 1:1:1 relationship for the moment.

Anyways, and more importantly, we were wondering if all these data types should be modelled in our ontology modules.

[1] https://liveunibo.sharepoint.com/:x:/r/sites/polifonia/Shared%20Documents/WP6/AI%26Music@Sonar2021/datasets/overview.xlsx?d=w352ea4d8aefe40cbb080d1c291c52998&csf=1&web=1&e=V2BAIm

enridaga commented 3 years ago

Anyways, and more importantly, we were wondering if all these data types should be modelled in our ontology modules. I think they all should be modelled, not sure if the modelling should be done with typing, probably not. But this is a question for the ontology engineering team.

An important question is which ones of these can be added to the datasets for the demo ASAP. (See also #10)

jonnybluesman commented 3 years ago

The datasets where all these fields come from are already in the datasets branch, but we can re-organise the structure of the repository to make it more readable and easy to navigate. From an analysis of the current data, we can conclude that:

In sum, we have all the data but we can rely mostly on 3, 4, 5, 6. As for the ontology modules, we are constantly in contact with @valecarriero, so our concern in this issue was actually shared.

jonnybluesman commented 3 years ago

Some modelling progress from the OD team curating the musical performance module (@valecarriero, @jonnybluesman, @andreamust). This partial model generalises some key parts of MusicBrainz (where we get most of the data), is still aligned with our competency questions, and addresses Places 5, and 6 (see list above).

recording-v2

valecarriero commented 3 years ago

Draft addressing Place 4 (see list above)

image

valecarriero commented 3 years ago

Draft addressing Place 3 (see list above)

image

valecarriero commented 3 years ago

Please note: all object properties in these diagrams will have their inverse.

valecarriero commented 3 years ago

Draft addressing Place 4 (see list above)

image

[update after 23/09/2021 WP2 meeting] Considering that the field we are modelling (artist_start) is ambiguous and not very specific (e.g. London for Queen as the place where the band was formed, Pinner for Elton John as the place where he was born and started to play/its musical activity), we will model this as a binary relation, and then we will have a more complex model if we will have more data e.g. about the timeline (start, end, steps, etc.)

image

valecarriero commented 3 years ago

Some modelling progress from the OD team curating the musical performance module (@valecarriero, @jonnybluesman, @andreamust). This partial model generalises some key parts of MusicBrainz (where we get most of the data), is still aligned with our competency questions, and addresses Places 5, and 6 (see list above).

recording-v2

[update] image

valecarriero commented 3 years ago

model for the coordinates (basically, based on https://w3id.org/italia/onto/CLV/)

image

ccolonna commented 3 years ago

As suggested by @vpresutti I'm gonna add to the KG production rules some binary relations for places from the song to the place, see this related issue:

We need to simplify query to select important examples for the demo! The model/ontology shall be reviewed then