martigso / stortingscrape

Scrape and structure raw data from the Norwegian parliament's API.
https://martigso.github.io/stortingscrape/
11 stars 0 forks source link
r-package scraping stortinget

stortingscrape

CRAN
Version Github
Version Downloads Total
Downloads R-CMD-check

stortingscrape is an R package for retrieving data from the Norwegian parliament (Stortinget) through their easily accessible back-end API. The data requested using the package require little to no further structuring. The scope of the package ranges from general data on the parliament itself (rules, session info, committees, etc) to data on the parties, bibliographies of the MPs, questions, hearings, debates, votes, and more.

The main goal of stortingscrape is to allow researchers to access any data from the Norwegian parliament easily, but also still be able to structure the data according to ones need. Most importantly, the package is facilitated for weaving together different parts of the data.stortinget.no API.

Installation

Because I’m not in control of the API itself, the CRAN package might at times be outdated. Submitting to CRAN is quite time consuming and will not be done more than once or twice a year.

CRAN (stable version)

The latest stable version of the stortingscrape package can be installed from by running CRAN:

install.packages("stortingscrape")

Github (development version)

The development version of the package can be installed either by cloning this repository and building the package in R or by installing via the devtools::install_github() function:

devtools::install_github("martigso/stortingscrape")
library(stortingscrape)

Usage examples

Request all interpellations for a parliamentary session:

library(stortingscrape)

parl_sessions |> # sessions data are built into the package
  head()         # but can also be retrieved with `get_parlsessions()`

qsesh <- get_session_questions(parl_sessions$id[4], q_type = "interpellasjoner")

int1213 <- list()

for(i in qsesh$id) {

  message("Getting ", i)

  int1213[[i]] <- get_question(i, good_manners = 2)

}

int1213 <- do.call(rbind, int1213)

head(int1213)

Get biographies of all MPs for a given parliamentary period (will take ~30min to run):

parl_periods # parliamentary periods (4 years) are built into the package,
             # but can also be retrieved with `get_parlperiods()`

mps <- get_parlperiod_mps(parl_periods$id[1], substitute = TRUE)

mps_bios <- lapply(mps$mp_id, get_mp_bio, good_manners = 2)

# Expand by all periods the MP has been in parliament
mps_periods <- lapply(mps_bios, function(x){

  data.frame(x$root,
             x$parl_periods)

})

mps_periods <- do.call(rbind, mps_periods)

# Expand by all positions held in parliament
mps_positions <- lapply(mps_bios, function(x){

  if(nrow(x$parl_positions) < 1) return()

  data.frame(x$root,
             x$parl_positions)

})

mps_positions <- do.call(rbind, mps_positions)

Data description

The back-end data is described in detail in the API of Stortinget.

Functions

You can find a list of all functions here.

Citation

Please use the following citation when using the package in research:

Søyland M (2024). “stortingscrape: An R package for accessing data from the Norwegian parliament.” URL: https://martigso.github.io/stortingscrape/articles/stortingscrape.html.

This can, as with all R-packages, also be generated using the citation function:

citation("stortingscrape")