Closed nathansam closed 2 years ago
Dear @nathansam, Thank you for your pre-submission. Thanks for mentioning linelist::guess_dates(), could you also expand this section and discuss lubridate::guess_formats()? Thanks, Julia
Thanks @jooolia, Sorry, I should have mentioned that before! I have now added a section on lubridate's date guessing functions above.
Dear @nathansam, We have determined that this package is in-scope for rOpenSci and we will welcome your full submission. Please add information about the other packages with similar (but not the same) functionality in your Readme. Another editor also mentioned {anytime} as another package worth describing in comparison to yours. Thanks, Julia
That is great news, thank you! My thanks to you and the rest of the team. I will of course do a full submission ASAP.
Submitting Author Name: Nathan Constantine-Cooke Submitting Author Github Handle: !--author1-->@nathansam<!--end-author1-- Repository: https://github.com/nathansam/datefixR Submission type: Pre-submission Language: en
Scope
Please indicate which category or categories from our package fit policies or statistical package categories this package falls under. (Please check an appropriate box below):
Data Lifecycle Packages
[ ] data retrieval
[ ] data extraction
[ ] database access
[x] data munging
[ ] data deposition
[ ] workflow automation
[ ] version control
[ ] citation management and bibliometrics
[ ] scientific software wrappers
[ ] database software bindings
[ ] geospatial data
[ ] text data
Statistical Packages
[ ] Bayesian and Monte Carlo Routines
[ ] Dimensionality Reduction, Clustering, and Unsupervised Learning
[ ] Machine Learning
[ ] Regression and Supervised Learning
[ ] Exploratory Data Analysis (EDA) and Summary Statistics
[ ] Spatial Analyses
[ ] Time Series Analyses
Explain how and why the package falls under these categories (briefly, 1-2 sentences). Please note any areas you are unsure of:
datefixR takes date data in which has been stored in many different formats (01/01/2001, 5 April 2020, Dec 2015 etc.) and converts them to R's
Date
type.NA
Any researchers using data entered via a questionnaire which (unfortunately) asked for a date as free-text. Given the nature of this data generation, this mainly affects those who work with human subjects.
lubridate::guess_formats()
can be used to guess a date format andlubridate::parse_date_time()
calls this function when it attempts to parse a vector into a POSIXct date-time object. However:{lubridate}
then the user is simply told how many dates failed to parse. In{datefixR}
the user is told the ID (assumed to be the first column by default but can be user-specified) corresponding to the date which failed to parse and reports the considered date: making it much easier to figure out which dates supplied failed to parse and why.{lubridate}
. In{datefixR}
, this behaviour can be controlled by themonth.impute
argument.orders
argument, which may result in a date format not being considered if the user forgets to list one of the possible formats. By contrast,{datefixR}
only needs a format to be specified if month-first is to be preferred over day-first when guessing a date.linelist::guess_dates()
appears to have performed a somewhat similar role. However, this function did not leave the experimental lifecycle phase and the package itself is no longer available on CRAN.The package is on CRAN with no reverse dependencies.