OxfordDemSci / ICS_Analysis

Mixed methods approach and interactive dashboard to analyse research impact through Impact Case Studies submitted to the UK's Research Excellence Framework (REF) 2021.
https://shape-impact.co.uk
GNU General Public License v3.0
5 stars 0 forks source link

Some islands not included when "global" beneficiaries #61

Closed doug-leasure closed 7 months ago

doug-leasure commented 7 months ago

All countries, territories, islands, etc that are selectable on the "Impact Beneficiaries" map in the dashboard should be included when the an ICS indicates "Global" as the beneficiary.

In other words, the minimum value in the legend should equal the number of impact case studies that listed "global" as a beneficiary (n=238?). Currently the minimum value is 1 (e.g. Puerto Rico).

doug-leasure commented 7 months ago

The problem is that the countries_iso column that we added to the REF database does not include PRI (Puerto Rico) (and maybe other countries?) when the countries column includes [Global] or [worldwide] or [international], etc.

doug-leasure commented 7 months ago

The following countries are not included when countries=[Global]: Puerto Rico (PR; PRI) British Virgin Islands (VG; VGB) Anguilla (AI; AIA) Turks and Caicos Islands (TC; TCA) Cayman Islands (KY; CYM) Pitcairn (PN; PCN) French Polynesia (PF; PYF) Niue (NU; NIU) etc...

Note: This list is not comprehensive. I need the full list of tags that we interpreted to indicate global impact (e.g. "Global", "worldwide", "international", etc). @yifan-lu-300 , do you have this list?

yifan-lu-300 commented 7 months ago

Hi @doug-leasure , this is the list of tags indicating global impact:

Notes:

  1. Case Insensitivity: These tags are not sensitive to case: both [worldwide] and [Worldwide] exist in the dataset.

  2. Tags with Additional Descriptors: Occasionally, these tags may include trailing descriptors within brackets, e.g. [Worldwide (encompassing 112 countries)], [Global (by informing Food and Agriculture Organization policy that is applied globally)].

  3. Redundant Spaces: Sometimes there are redundant spaces before or after the tag, e.g. [ global].

  4. Combination Tags: Sometimes, multiple tags appear combined within a single bracket, e.g. [UK; European; USA; China; Global], [UK and Worldwide].

doug-leasure commented 7 months ago

Thanks, @yifan-lu-300 ! This is really helpful and thanks for keeping such detailed notes.

Do you have similar notes for the regional tags like "Europe", "European Union", "Middle East", "West Africa", etc. and the country list used for each? Is this captured in a machine readable format (e.g. JSON) anywhere?

Thank you!

doug-leasure commented 7 months ago

The country lists by region and by union are found in ./data/manual/funder_countries/funder_countries_lookup.xlsx