mintproject / MINT-DataCatalog-Public

Public-facing aspects of data catalog, such as documentation, demos, tracking issues, and feature requests
Apache License 2.0
1 stars 1 forks source link

SVO names for "UNHCR's populations of concern" #7

Open dnfeldman opened 5 years ago

dnfeldman commented 5 years ago

Dataset Name: UNHCR's populations of concern originating from Ethiopia Dataset Source: https://data.humdata.org/dataset/refugees-originating-eth Sample File URL: http://proxy.hxlstandard.org/data.csv?url=http%3A//popstats.unhcr.org/en/persons_of_concern.hxl&filter01=select&select-query01-01=%23country%2Borigin=Ethiopia

Variables

For variables that need to be linked to SVO, please provide variable's name (as it appears in the data file), its description, and its type (categorical or numerical).

  1. date+year:
    • description: Year
    • type: numerical.discrete
  2. country+residence:
    • description: Country / territory of asylum/residence
    • type: categorical.nominal
  3. country+origin:
    • description: Origin
    • type: categorical.nominal
  4. affected+refugees:
    • description: Refugees (incl. refugee-like situations)
    • type: numerical.discrete
  5. affected+asylum:
    • description: Asylum-seekers (pending cases)
    • type: numerical.discrete
  6. affected+returned_refugees:
    • description: Returned refugees
    • type: numerical.discrete
  7. affected+idps:
    • description: Internally displaced persons (IDPs)
    • type: numerical.discrete
  8. affected+returned_idps:
    • description: Returned IDPs
    • type: numerical.discrete
  9. affected+stateless:
    • description: Stateless persons
    • type: numerical.discrete
  10. affected+others:
    • description: Others of concern
    • type: numerical.discrete
  11. affected+total:
    • description: Total Population
    • type: numerical.discrete
mariutzica commented 5 years ago

@dnfeldman Below are the suggested SVO labels. Please let me know if you detect anything amiss.

1 date+year: ◦ description: Year ◦ type: numerical.discrete ◦ ontology label: event__observation_year 2 country+residence: ◦ description: Country / territory of asylum/residence ◦ type: categorical.nominal ◦ ontology label: persons~of-concern~departed_region~country-or-territory~asylum-or-residence__name 3 country+origin: ◦ description: Origin ◦ type: categorical.nominal ◦ ontology label: persons~of-concern_region~country~origin__name 4 affected+refugees: ◦ description: Refugees (incl. refugee-like situations) ◦ type: numerical.discrete ◦ ontology label: region~country~origin_region~country-or-territory~asylum-or-residence_persons~refugees-and-persons~refugee-like__count 5 affected+asylum: ◦ description: Asylum-seekers (pending cases) ◦ type: numerical.discrete ◦ ontology label: region~country~origin_region~country-or-territory~asylum-or-residence_persons~asylum-seekers__count 6 affected+returned_refugees: ◦ description: Returned refugees ◦ type: numerical.discrete ◦ ontology label: region~country~origin_region~country-or-territory~asylum-or-residence_persons~refugees~returned__count 7 affected+idps: ◦ description: Internally displaced persons (IDPs) ◦ type: numerical.discrete ◦ ontology label: region~country~origin_persons~internally-displaced__count 8 affected+returned_idps: ◦ description: Returned IDPs ◦ type: numerical.discrete ◦ ontology label: region~country~origin_persons~internally-displaced~returned__count 9 affected+stateless: ◦ description: Stateless persons ◦ type: numerical.discrete ◦ ontology label: region~country~origin_persons~stateless__count 10 affected+others: ◦ description: Others of concern ◦ type: numerical.discrete ◦ ontology label: region~country~origin_persons~others-of-concern__count 11 affected+total: ◦ description: Total Population ◦ type: numerical.discrete ◦ ontology label: region~country~origin_persons~of-concern__count

dnfeldman commented 5 years ago

Thanks!

These look good to me. Quick question, how come for some (e.g. #6), you have ...~country-or-territory~... but for others (e.g., #10), it's only ~country~ (no -or-territory)?

mariutzica commented 5 years ago

Hi Dan, for this particular dataset, the country of origin is always Ethiopia, so we can be specific and say region~country~origin. On the other hand, the destination could be a country or a territory, so this identifier is less specific region~country-or-territory~... if the dataset had contained information about persons from origins that were territories as well, then we would have used region~country-or-territory~origin to denote the origin location. We attempt to always provide the most specific label for a grouping. I hope that makes sense.

On Jun 28, 2019, at 3:09 PM, dnfeldman notifications@github.com wrote:

Thanks!

These look good to me. Quick question, how come for some (e.g. #6), you have ...~country-or-territory~... but for others (e.g., #10), it's only ~country~ (no -or-territory)?

— You are receiving this because you were assigned. Reply to this email directly, view it on GitHub, or mute the thread.