sampling - Githubissues

GretaTimaite / UNBigDataHackathon2022

https://gretatimaite.github.io/campr/

0 stars 0 forks source link

Open GretaTimaite opened 1 year ago

GretaTimaite commented 1 year ago

A few ideas for sampling:

random sample from all the countries
1 country per continent
based on GDP
based on where we're from :)
clustering algorithm, such as kmeans, hierarchical...
- what would you cluster based on?
random sample per each category in GDP per capita as in World Bank data: https://data.worldbank.org/indicator/NY.GDP.PCAP.CD?view=map

GretaTimaite commented 1 year ago

GretaTimaite commented 1 year ago

@Kika0 is on fire!!!

GretaTimaite commented 1 year ago

done some means sampling, here's a result

But then we kind of thought that maybe it makes sense to predict support for environmental protection to all countries...

EDIT: code can be found in sampling.R

GretaTimaite commented 1 year ago

Also I've realised that World Values Survey might not have data on EACH country, so I'll subset data based on this...