Enhance pull_unaids() function to return global estimates using range_speedread()

Issue It takes around 400 seconds to read in the full global HIV estimates (HIV Estimates - Integer). What if we use the range_speedread() function to speed up the reading process. Jenny Bryan explains the gist of this here.

Unit: seconds
  expr       min        lq      mean    median        uq       max neval
 speed   7.99622   7.99622   7.99622   7.99622   7.99622   7.99622     1
  pull 397.76969 397.76969 397.76969 397.76969 397.76969 397.76969     1

Actions suggested

[ ] enhance the pull_unaids() function with an additional parameter that allows the user to select pepfar_only = T / F. If T, run pull_unaids() as is, if F, then pass in the range_speedread() function to grab results for all countries/regions. Will likely also require a switch in the google_id.

Here is a code snippet if you want to test the speed of each function:

library(googlesheets4)
library(mindthegap)
library(microbenchmark)

 glbl_id <- "UNAIDS 2021 Clean Estimates [update to sheet id]"

  speed_read <- function(){
   df <-  range_speedread("glbl_id", sheet = "HIV Estimates - Integer") 
   return(df)
  }

  pull_read <- function(){
    df <- googlesheets4::read_sheet("glbl_id", sheet = "HIV Estimates - Integer")
    return(df)
  }

  microbenchmark(
    speed = speed_read(),
    pull = pull_read(),
    times = 1
  )

USAID-OHA-SI / mindthegap

Enhance pull_unaids() function to return global estimates using range_speedread() #13