UBC-MDS / software-review-2022

0 stars 0 forks source link

Submission Group 4: slimReda (R) #29

Open y248guo opened 2 years ago

y248guo commented 2 years ago

name: slimReda about: A slim R package for EDA


Submitting Author Names and Github Handles: Simon Guo ( !--author1-->@y248guo<!--end-author1--) Anthea Chen ( !--author2-->@anthea98<!--end-author2--) Khalid Abdilahi ( !--author3-->@khalidcawl<!--end-author3--) Taiwo Owoseni ( !--author4-->@thayeylolu<!--end-author4--)

Submitting Author Github Handle: !--author1-->@y248guo<!--end-author1-- Other Package Authors Github handles: (comma separated, delete if none) @anthea98, @khalidcawl, @thayeylolu Repository: https://github.com/UBC-MDS/slimreda Version submitted: 0.2.0 Submission type: Standard Editor: Simon Guo Reviewers: Alex Guo, Nagraj Rao, Adrianne Leung, Rakesh Pandey

Language: en

Package: slimreda
Title: Exploratory Data Analysis 
Version: 0.0.0.9000
Authors@R: 
      c(person("Anthea", "Chen",, "anthea.chen@ubc.ca", role = c("ctb")),
      person("Khalid", "Abdilahi", ,  "khalid.abdilahi@ubc.ca", role = "ctb"),
      person("Simon", "Guo", , "simon.guo@ubc.ca", role = "ctb"),
      person("Taiwo", "Owosenii", , "thayehas@ubc.ca", role = c("aut","ctb","cre")))
Description: 
    Exploratory Data Analysis is an important preparatory work to help data scientists understand and clean up 
    data sets before machine learning begins. However, this step also involves a lot of repetitive tasks. 
    In this context, slimeda will help data scientists quickly complete the initial work of EDA and gain a preliminary understanding of the data.
License: MIT + file LICENSE
Encoding: UTF-8
LazyData: true
Roxygen: list(markdown = TRUE)
RoxygenNote: 7.1.1
Suggests: 
    testthat (>= 3.0.0)
Config/testthat/edition: 3
Imports: 
    purrr,
    dplyr,
    lubridate,
    tibble,
    ggplot2,
    palmerpenguins,
    rlang,
    tidyr

Scope

The package slimreda explores the dataset by creating histograms and correlation plots, in addition to that we have functions for counting missing values and counting unique values

Researchers or data analysts who would like to work with EDA

Yes, but our R package aims to provide a simple straightforward modifiable objects with maximum flexibility. Also comparing many mainstream packages, our package is lighter and simpler.

No

Technical checks

Confirm each of the following by checking the box.

This package:

Publication options

MEE Options - [ ] The package is novel and will be of interest to the broad readership of the journal. - [ ] The manuscript describing the package is no longer than 3000 words. - [ ] You intend to archive the code for the package in a long-term repository which meets the requirements of the journal (see [MEE's Policy on Publishing Code](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/journal-resources/policy-on-publishing-code.html)) - (*Scope: Do consider MEE's [Aims and Scope](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/aims-and-scope/read-full-aims-and-scope.html) for your manuscript. We make no guarantee that your manuscript will be within MEE scope.*) - (*Although not required, we strongly recommend having a full manuscript prepared when you submit here.*) - (*Please do not submit your package separately to Methods in Ecology and Evolution*)

Code of conduct

adrianne-l commented 2 years ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Functionality

Estimated hours spent reviewing: 1 hour


Review Comments

Great job guys! I find slimreda a very useful package for initial EDA of any numerical analysis. It is a fairly universal package for application in many fields. The package is organized well with clear differentiation between functions so that user can pick for their own usage.

nrao944 commented 2 years ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Functionality

Estimated hours spent reviewing: 1.5 hours


Review Comments.

EDA is so critical for any analysis, and to that extent, I really appreciate the efforts made by the team to make the EDA process clean, straightforward, and intuitive. Thank you for your splendid work!

The downloading process works, and the commands work as displayed in the README. Furthermore, just to double check, I downloaded the project, and ran a check and test command, and everything went smoothly.

While the package is already an A+ level in my assessment, here are a few suggestions for your kind consideration, which would make it even more intuitive for a first time user.

1) Since we did not need to officially publish the package on CRAN, it might be worthwhile to remove this portion from the README.

2) During installation, I was prompted a message - do you want to install the latest version of the dependencies used for your package. It might be useful for the user to specify what option to pick (Yes, No, from Cran etc.).

3) It might be useful to have automated titles for the images or correlation plot being generated.

4) For the miss_count function, the percentages that are displayed could be multiplied by 100 instead of showcasing in decimals.

5) I noticed that the code coverage badge is not included in the README. I would consider adding the same.

6) In the Code of Conduct, under Enforcement, the [INSERT CONTACT METHOD] should be filled.

alexYinanGu0 commented 2 years ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Functionality

Estimated hours spent reviewing: 1.5 hour


Review Comments

A version of this package for your version of R might be available elsewhere, see the ideas at https://cran.r-project.org/doc/manuals/r-patched/R-admin.html#Installing-packages


- In code of conduct, there is a part `[INSERT CONTACT METHOD]` needed to be filled out.
- It would be better to include r code coverage badge as well in README.
veerupandey commented 2 years ago

Reviewer: Rakesh Pandey

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Functionality

Estimated hours spent reviewing: 1 hour


Review Comments

Excellent work, Team! Great package; I like the idea of making EDA as simple as possible.

  1. The installation instructions for install_github are clear and easy to follow. It installs without any issue on Ubuntu and Debian.

  2. Because the package is not available on CRAN, the installation instructions should not refer to CRAN.

  3. A link to the contributing file in the README would be beneficial.

  4. Only one developer's name appears in the README Licence section. Ideally, it should include all of the developers.

  5. It would be helpful to include a badge for code coverage. It provides confidence in using an unknown package because it has been thoroughly tested.

  6. In R/cat_unique_counts_script.R, libraries dplyr, purrr, and lubricate were manually imported. However, with R packages, this should be added as an overall dependency, and the function should be referred to with a package name like dplyr::select.