UBC-MDS / software-review-2021

1 stars 1 forks source link

Submission: eaziReda (R) #20

Open arashshams opened 3 years ago

arashshams commented 3 years ago

Submitting Author:

Repository: eaziReda Version submitted: 0.2.0 (TBD) Editor: TBD Reviewers: TBD

Archive: TBD Version accepted: TBD


Package: eaziReda
Title: A Quick And Easy Way To Do EDA And Preprocessing
Version: 0.0.0.9000
Authors@R:
    c(
    person(given = "Dustin",
           family = "Andrews",
           role = c("aut", "cre"),
           email = "dandrew9@student.ubc.ca"),
    person(given = "Vignesh",
           family = "Rajakumar",
           role = c("aut")),
    person(given = "Arash",
           family = "Shamseddini",
           role = c("aut")),
    person(given = "Yuyan",
           family = "Guo",
           role = c("aut"))
    )
Description: Almost every data analysis project involves the process of doing some exploratory data analysis(EDA) and data preprocessing. 
  Usually they serve as a very crucial and inevitable step in a data analysis workflow. 
  There are some very common tasks in EDA, which can include checking missing values, detecting outliers, ploting correlation plots between features
  and ploting histograms/bar plots for each individual features.
  Typically these steps are followed by some preprocesing like imputation and dealing with outliers. 
  All of those steps together may require lots of coding effort and can be repeated for several projects. 
  To solve this issue, we designed this R package eaziReda that wraps all of those lines of code into four convenient 
  functions that will allow you to quickly and easily carry out EDA along with some simple preprocessing using just four lines of code!
License: MIT + file LICENSE
Encoding: UTF-8
LazyData: true
Roxygen: list(markdown = TRUE)
RoxygenNote: 7.1.1
Imports: 
    magrittr,
    ggplot2,
    dplyr,
    cowplot,
    tidyr,
    tidyselect,
    rlang,
    vdiffr (>= 0.3.3),
    tibble,
    isotree,
    data.table,
    purrr
Suggests: 
    testthat (>= 3.0.0),
    covr
Config/testthat/edition: 3
Remotes: 
    r-lib/vdiffr
URL: https://ubc-mds.github.io/eaziReda, https://github.com/UBC-MDS/eaziReda
BugReports: https://github.com/UBC-MDS/eaziReda/issues

Scope

eaziReda has the functionality to produce interactive plots (e.g. histograms and correlation plots) to graphically demonstrate the distribution and correlation of features inside a given dataset. Another functionality of eaziReda is data wrangling since at its core it is designed to deal with missing data and outliers.

The target audience would be those who are interested to get an interactive visualization of the dataset at hand and also people who wish to do a quick data munging especially if their dataset contains missing values and outliers.

There are similar R packages such as "SmartEDA" or "dlookr", but eaziReda's functionality is to address the most-wanted EDA and Data wrangling jobs quickly and conveniently. Another difference is that eaziReda is quite light weighted.

Technical checks

Confirm each of the following by checking the box.

This package:

Publication options

MEE Options - [ ] The package is novel and will be of interest to the broad readership of the journal. - [ ] The manuscript describing the package is no longer than 3000 words. - [ ] You intend to archive the code for the package in a long-term repository which meets the requirements of the journal (see [MEE's Policy on Publishing Code](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/journal-resources/policy-on-publishing-code.html)) - (*Scope: Do consider MEE's [Aims and Scope](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/aims-and-scope/read-full-aims-and-scope.html) for your manuscript. We make no guarantee that your manuscript will be within MEE scope.*) - (*Although not required, we strongly recommend having a full manuscript prepared when you submit here.*) - (*Please do not submit your package separately to Methods in Ecology and Evolution*)

Code of conduct

dusty736 commented 3 years ago

Package Review

I am a fellow colleague in the UBC MDS program, and have worked with the authors of this package in the past. There is no conflict of interest.

Documentation

The package includes all the following forms of documentation:

For packages co-submitting to JOSS

The package contains a paper.md matching JOSS's requirements with:

  • [x] A short summary describing the high-level functionality of the software
  • [x] Authors: A list of authors with their affiliations
  • [x] A statement of need clearly stating problems the software is designed to solve and its target audience.
  • [x] References: with DOIs for all those that have one (e.g. papers, datasets, software).

Functionality

Estimated hours spent reviewing: 3


Review Comments

Before getting to the suggestions for improvement, I just wanted to say that I think you all did a great job. The install was smooth, all of the examples run for each function, and the vignette was easy to follow. The target audience, and purpose of this package were very clear to me. Overall I had to get very nit-picky to find feedback.

Checks:

Constructive Feedback:

dbandrews commented 3 years ago

Thanks @dusty736 for the review - we'll definitely be fixing the majority of issues you've raised here this week.

dusty736 commented 3 years ago

@dbandrews - Absolutely! Great job!

ssyayayy commented 3 years ago

Package Review

Documentation

The package includes all the following forms of documentation:

For packages co-submitting to JOSS

The package contains a paper.md matching JOSS's requirements with:

  • [ ] A short summary describing the high-level functionality of the software
  • [ ] Authors: A list of authors with their affiliations
  • [ ] A statement of need clearly stating problems the software is designed to solve and its target audience.
  • [ ] References: with DOIs for all those that have one (e.g. papers, datasets, software).

Functionality

Estimated hours spent reviewing: 3 hours


Review Comments

Hi eaziReda developers,

Thanks for delivering this great package to R society. I really enjoyed reading through your vignette document and I was inspired by your great ideas in your package. I was able to successfully install the package and run the vignette file. The design of the histograms are neat and professional. The code of all functions are easy to follow and well commented. There are a few thoughts that you may consider to implement in eaziReada package in the future.

Overall, you all did a great job on this project and I can see analysts using it in the future! Hope my thoughts above are not hard to follow. Please feel free to contact me if you have any questions or concerns!

Thanks, Ivy