rOpenGov / psData

An R package to download regularly maintained political science data sets and make commonly used, but infrequently updated variables based on this data.
https://ropengov.github.io/psData/
45 stars 10 forks source link

psData

rOG-badge CRAN
Version R-CMD-check CRAN Monthly
Downloads CRAN
Total Downloads

Started by Christopher Gandrud


This R package includes functions for gathering commonly used and regularly maintained political science data sets. It also includes functions for combining components from these data sets into variables that have been suggested in the political science literature, but are not regularly updated.

psData includes two primary function types: Getters and Variable Builders. Getter functions automate the gathering and cleaning of particular data sets so that they can easily be merged with other data. They do not transform the underlying data. Variable Builders use Getters to gather data and then transform it into new variables suggested by the political science literature. The functions currently part of psData include:

Getters

Variable Builders

Others

Other functions included that might be useful to people working with political science data:


Updates

Most of the Getter functions currently included in psData download data from a specific URL that links to a data file. Hopefully, the data sets’ authors will keep their data up-to-date. When they make updates, they will likely link to the updated file with a new URL. All of the functions in psData that gather data from a file at a specific URL allow the user to specify a new URL, if they want to.

If you notice an updated version of one of the data sets, feel free to submit a Pull Request with the new URL. It would be great if you make sure that the function still works, as the data set’s authors may change the format breaking the Getter function.

Suggestions

Please feel free to suggest other data set downloading and variable creating functions. To do this just leave a note on the package’s Issues page.

Also feel free to make a pull request with a new Getter or Variable Builder. Please make the pull request on a branch other than the master.


Examples

To download only the polity2 variable from Polity IV:

library(psData)
PolityData <- PolityGet(vars = "polity2")

head(PolityData)
#>   iso2c standardized_country     country year polity2
#> 1    AF          Afghanistan Afghanistan 1800      -6
#> 2    AF          Afghanistan Afghanistan 1801      -6
#> 3    AF          Afghanistan Afghanistan 1802      -6
#> 4    AF          Afghanistan Afghanistan 1803      -6
#> 5    AF          Afghanistan Afghanistan 1804      -6
#> 6    AF          Afghanistan Afghanistan 1805      -6

Note that the iso2c variable refers to the ISO two letter country code country ID. This standardised country identifier could be used to easily merge the Polity IV data with another data set. Another country ID can be selected with the OutCountryID argument. See the package documentation for details.

To create winset (W) and selectorate (ModS) data use the following code:

library(psData)

WinData <- WinsetCreator()

head(WinData)
#>   iso2c              country year    W ModS
#> 1    AE United Arab Emirates 1975 0.25 0.25
#> 2    AE United Arab Emirates 1976 0.25 0.25
#> 3    AE United Arab Emirates 1977 0.25 0.25
#> 4    AE United Arab Emirates 1978 0.25 0.25
#> 5    AE United Arab Emirates 1979 0.25 0.25
#> 6    AE United Arab Emirates 1980 0.25 0.25