polifonia-project / sonar2021_demo

This repository is created for the documentation of the Polifonia demo that is going to be presented to SONAR2021
https://polifonia-project.github.io/sonar2021_demo/
2 stars 0 forks source link

Add Place information to datasets #10

Closed enridaga closed 2 years ago

enridaga commented 3 years ago

Place information can be derived from songfacts.

jonnybluesman commented 3 years ago

This is already there, although it is not available for all tracks! As you can see from this example, when information about places is available, you should find: (i) the name of the place; and (ii) a list of facts on that place. In addition, in SongFacts, places and images are also very related. This is all documented in our list here.

enridaga commented 3 years ago

I see, I guess what is missing is coordinate information, to display the place in a map. Maybe something that can be derived from Geonames?

enridaga commented 3 years ago

Another way is using some NER and get the DBpedia/Wikidata entity to get the coordinates from. For example DBpedia Spotlight https://www.dbpedia-spotlight.org/api with your example "Aurora, Illinois":

curl -X GET "https://api.dbpedia-spotlight.org/en/annotate?text=Aurora%2C%20Illinois" -H "accept: application/json"

{
  "@text": "Aurora, Illinois",
  "@confidence": "0.5",
  "@support": "0",
  "@types": "",
  "@sparql": "",
  "@policy": "whitelist",
  "Resources": [
    {
      "@URI": "http://dbpedia.org/resource/Aurora,_Illinois",
      "@support": "1280",
      "@types": "Wikidata:Q515,Wikidata:Q486972,Schema:Place,Schema:City,DBpedia:Settlement,DBpedia:PopulatedPlace,DBpedia:Place,DBpedia:Location,DBpedia:City",
      "@surfaceForm": "Aurora, Illinois",
      "@offset": "0",
      "@similarityScore": "0.9999999999997016",
      "@percentageOfSecondRank": "0.0"
    }
  ]
}

which will allow to get the coords from DBpedia http://dbpedia.org/resource/Aurora,_Illinois

jonnybluesman commented 3 years ago

Got it, then we will enrich all those annotations for which place information is available, ideally after a first step of aggregation to avoid spreading data all over. Thanks!

jonnybluesman commented 3 years ago

Another way is using some NER and get the DBpedia/Wikidata entity to get the coordinates from. For example DBpedia Spotlight https://www.dbpedia-spotlight.org/api with your example "Aurora, Illinois":

curl -X GET "https://api.dbpedia-spotlight.org/en/annotate?text=Aurora%2C%20Illinois" -H "accept: application/json"

{
  "@text": "Aurora, Illinois",
  "@confidence": "0.5",
  "@support": "0",
  "@types": "",
  "@sparql": "",
  "@policy": "whitelist",
  "Resources": [
    {
      "@URI": "http://dbpedia.org/resource/Aurora,_Illinois",
      "@support": "1280",
      "@types": "Wikidata:Q515,Wikidata:Q486972,Schema:Place,Schema:City,DBpedia:Settlement,DBpedia:PopulatedPlace,DBpedia:Place,DBpedia:Location,DBpedia:City",
      "@surfaceForm": "Aurora, Illinois",
      "@offset": "0",
      "@similarityScore": "0.9999999999997016",
      "@percentageOfSecondRank": "0.0"
    }
  ]
}

which will allow to get the coords from DBpedia http://dbpedia.org/resource/Aurora,_Illinois

Looks promising, thanks for the suggestion!