Submission: samplingsimulatorr (R)

Submitting Author: Holly Williams(@hwilliams10), Lise Braaten(@lisebraaten), Tao Gup (@tguo9), Yue (Alex) Jiang (@YueJiangMDSV ), Repository: samplingsimulatorr Version submitted: 1.1.0 Editor: Varada Kolhatkar (@kvarada) Reviewer 1: Ryan Homer (@ryanhomer) Reviewer 2: Jaekeun Lee (@agdal1125) Archive: TBD
Version accepted: TBD

Package: samplingsimulatorr
Title: What the Package Does (One Line, Title Case)
Version: 0.0.0.9000
Authors@R: 
    c(person(given = "Tao",
             family = "Guo",
             role = c("aut", "cre"),
             email = "tguo9@dons.usfca.edu"),
      person(given = "Yue",
             family = "Jiang",
             role = c("aut"),
             email = "yue856@gmail.com"),
      person(given = "Lise",
             family = "Braaten",
             role = c("aut"),
             email = "lisebraaten@gmail.com"),
      person(given = "Holly",
             family = "Williams",
             role = c("aut"),
             email = "Holly.Rourke@gmail.com"))
Description: What the package does (one paragraph).
License: MIT + file LICENSE
Encoding: UTF-8
LazyData: true
Roxygen: list(markdown = TRUE)
RoxygenNote: 7.0.2
Imports: 
    rlang,
    vctrs,
    lifecycle,
    pillar,
    dplyr,
    infer,
    magrittr,
    gridExtra,
    ggplot2
Suggests: 
    testthat (>= 2.1.0),
    covr,
    knitr,
    rmarkdown
URL: https://github.com/UBC-MDS/samplingsimulatorr
BugReports: https://github.com/UBC-MDS/samplingsimulatorr/issues
VignetteBuilder: knitr

Scope

Please indicate which category or categories from our package fit policies this package falls under: (Please check an appropriate box below. If you are unsure, we suggest you make a pre-submission inquiry.):
- [ ] data retrieval
- [ ] data extraction
- [ ] data munging
- [ ] data deposition
- [ ] workflow automataion
- [ ] version control
- [ ] citation management and bibliometrics
- [x] scientific software wrappers
- [ ] database software bindings
- [ ] geospatial data
- [ ] text analysis
Explain how and why the package falls under these categories (briefly, 1-2 sentences):

This package is intended to assist in teaching and/or learning basic statistical inference by allowing users to generate virtual populations to compare and contrast sampling vs sample distributions and parameters.

Who is the target audience and what are scientific applications of this package?
The target audience is instructors and/or students teaching or learning basic statistical inference.
Are there other R packages that accomplish the same thing? If so, how does yours differ or meet our criteria for best-in-category?

To the best of our knowledge, there is currently no existing R package with the specific functionality to create virtual populations and make the specific sample and sampling distributions described above. We do make use of many existing R packages and expand on them to make very specific functions. These include: built-in r distribution functions such as rnorm to sample from distributions rep_sample_n to generate random samples， and ggplot2 to create plots

If you made a pre-submission enquiry, please paste the link to the corresponding issue, forum post, or other discussion, or @tag the editor you contacted.

N/A

Technical checks

Confirm each of the following by checking the box.

[x] I have read the guide for authors and rOpenSci packaging guide.

This package:

[x] does not violate the Terms of Service of any service it interacts with.
[x] has a CRAN and OSI accepted license.
[x] contains a README with instructions for installing the development version.
[x] includes documentation with examples for all functions, created with roxygen2.
[x] contains a vignette with examples of its essential functions and uses.
[x] has a test suite.
[x] has continuous integration, including reporting of test coverage using services such as Travis CI, Coveralls and/or CodeCov.

Publication options

[ ] Do you intend for this package to go on CRAN?
[ ] Do you intend for this package to go on Bioconductor?
[ ] Do you wish to automatically submit to the Journal of Open Source Software? If so:

JOSS Options

- [ ] The package has an **obvious research application** according to [JOSS's definition](https://joss.readthedocs.io/en/latest/submitting.html#submission-requirements). - [ ] The package contains a `paper.md` matching [JOSS's requirements](https://joss.readthedocs.io/en/latest/submitting.html#what-should-my-paper-contain) with a high-level description in the package root or in `inst/`. - [ ] The package is deposited in a long-term repository with the DOI: - (*Do not submit your package separately to JOSS*)

[ ] Do you wish to submit an Applications Article about your package to Methods in Ecology and Evolution? If so:

MEE Options

- [ ] The package is novel and will be of interest to the broad readership of the journal. - [ ] The manuscript describing the package is no longer than 3000 words. - [ ] You intend to archive the code for the package in a long-term repository which meets the requirements of the journal (see [MEE's Policy on Publishing Code](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/journal-resources/policy-on-publishing-code.html)) - (*Scope: Do consider MEE's [Aims and Scope](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/aims-and-scope/read-full-aims-and-scope.html) for your manuscript. We make no guarantee that your manuscript will be within MEE scope.*) - (*Although not required, we strongly recommend having a full manuscript prepared when you submit here.*) - (*Please do not submit your package separately to Methods in Ecology and Evolution*)

Code of conduct

[x] I agree to abide by rOpenSci's Code of Conduct during the review process and in maintaining my package should it be accepted.

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

[x] As the reviewer I confirm that there are no conflicts of interest for me to review this work (If you are unsure whether you are in conflict, please speak to your editor before starting your review).

Documentation

The package includes all the following forms of documentation:

[x] A statement of need clearly stating problems the software is designed to solve and its target audience in README
[x] Installation instructions: for the development version of package and any non-standard dependencies in README
[x] Vignette(s) demonstrating major functionality that runs successfully locally
[x] Function Documentation: for all exported functions in R help
[x] Examples for all exported functions in R Help that run successfully locally
[x] Community guidelines including contribution guidelines in the README or CONTRIBUTING, and DESCRIPTION with URL, BugReports and Maintainer (which may be autogenerated via Authors@R).

For packages co-submitting to JOSS

[ ] The package has an obvious research application according to JOSS's definition

The package contains a paper.md matching JOSS's requirements with:

[ ] A short summary describing the high-level functionality of the software

[ ] Authors: A list of authors with their affiliations

[ ] A statement of need clearly stating problems the software is designed to solve and its target audience.

[ ] References: with DOIs for all those that have one (e.g. papers, datasets, software).

Functionality

[x] Installation: Installation succeeds as documented.
[ ] Functionality: Any functional claims of the software been confirmed.
[ ] Performance: Any performance claims of the software been confirmed.
[ ] Automated tests: Unit tests cover essential functions of the package and a reasonable range of inputs and conditions. All tests pass on the local machine.
[x] Packaging guidelines: The package conforms to the rOpenSci packaging guidelines

Final approval (post-review)

[ ] The author has responded to my review and made changes to my satisfaction. I recommend approving this package.

Estimated hours spent reviewing:

2 hours
[x] Should the author(s) deem it appropriate, I agree to be acknowledged as a package reviewer ("rev" role) in the package DESCRIPTION file.

Review Comments

General Comments

The logo looks awesome and the installation process was super smooth!
Function names in the package are straight forward and self-explanatory. Their purposes were clear and it was easy to understand how to work with them in general.
It didn't take long for me to get familiar with the package.
The examples are mostly well written helping users navigate through the functions.

Test

I see that your test-stat_summary.R uses arbitrary values as input instead of objects created by your functions. The package would be more stable if you try to test with the output of your function objects as input.

Suggestions

It would be better if the authors can add default arguments to the functions to make them more convenient. samplingsimulatorr differentiate from the basic function by shortening the process. Thus providing default arguments would add more values to the package.
The input types are not controlled nor specified in functions. For example, argument n_s in draw_samples() can accept double values instead of an array. In stats_summary the argument parameter the documentation doesn't tell what type of input it takes.
The error messages were unclear to me in general. It would be better if the functions notify the users about the correct type of inputs. I could not figure out how stats_summary() works, although I followed your examples.
In generate_virtual_pop documentation, description about dist seems a bit unclear. For example, dnorm(), dpois() will not work although it is a function of a distribution. Specifying a list of possible functions that you can use for dist in the error message would be super helpful
In draw_samples(), the names of the columns were a bit confusing. replicate, size, rep_size were not very intuitive. Instead, names such as sample group, sample size or using the same argument name reps and n_s would be more informative.

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

[X] As the reviewer I confirm that there are no conflicts of interest for me to review this work (If you are unsure whether you are in conflict, please speak to your editor before starting your review).