sailuh / kaiaulu

An R package for mining software repositories
http://itm0.shidler.hawaii.edu/kaiaulu
Mozilla Public License 2.0
20 stars 13 forks source link

Mbox Download Notebook Producing Empty Folders #249

Open lh-zhan opened 1 year ago

lh-zhan commented 1 year ago

Config File:

conf <- yaml::read_yaml("../conf/hadoop.yml") save_path_mbox <- conf[["mailing_list"]][["mbox"]] 
mod_mbox_url <- conf[["mailing_list"]][["domain"]] 
mailing_list <- conf[["mailing_list"]][["list_key"]] 
start_year <- 2009 
end_year <- 2023

While downloading the mbox using this notebook, I noticed that if I specify a start year or end year that's outside of the actual mbox coverage, the notebook would create empty folders within that specified time coverage. Constraints should be added to avoid creating empty folders.

carlosparadis commented 1 year ago

Thank you for this!