Open AraiYuno opened 2 years ago
Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
The package includes all the following forms of documentation:
URL
, BugReports
and Maintainer
(which may be autogenerated via Authors@R
).Estimated hours spent reviewing: 1
When I saw that code coverage of 100% I got excited that I will be reviewing a well-put-together package, and I am happy to say that I am still very pleased with the package. It is very light weight and easy to use. Here are some minor comments with room for improvement:
detect_outliers
, but both your results and code imply that you have chosen a boxplot. This should be an easy fix.detect_outliers()
to also generate a title.install.packages("snapedautilityR")
. This script does not work and by checking CRAN I realized this package does not exist in their repository. The statement should be taken out. Note: The installation instructions from Github repo worked perfect. plot_histograms
function. The extra arguments should be taken out from plot <- plot_histograms(df, c("species", "bill_length_mm", "island"), 2, 100, 100)
. plot_corr
function in the Usage section of README generates a plot but without colors (just labels). Maybe replace it with the one from your Vignette as that one is working fine.Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
The package includes all the following forms of documentation:
URL
, BugReports
and Maintainer
(which may be autogenerated via Authors@R
).Estimated hours spent reviewing: 1.5
Congrats team on the great development of EDA package in R! The main functions of package are easy e to understand, install, and use. Same as your Python package, this project also performed very well overall, but I do have a few suggestions for improvement :
detect_outlier
, you mentioned that it returns a violin plot; however, it plots a boxplot.detect_outlier
, the first argument s
say that it should be a list of doubles, but users should pass a vector as per your examples. This can be easily fixed by changing the type of s
argument to a vector of doubles in the documentation.plot_histograms
, it says that Detect outliers in the given list
which I think this description is for the detect_outlier
function. In addition, for argument feature
the function accepts a vector, but in documentation, it says List of string feature names
.README.md
file to engage users with your output.README.md
file plot_distribution has been introduced with 4 arguments but in the article section of the website(vignette
), It has been introduced with 2 arguments which I think that just the usage part of README.md
file needs to be updated.Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
The package includes all the following forms of documentation:
URL
, BugReports
and Maintainer
(which may be autogenerated via Authors@R
).Estimated hours spent reviewing: 1.5
Great work on the package guys. One can definitely generate plots within a snap! I reviewed the package and listed below are some suggestions:
The rendered documentation looks good. I noticed that the version next to the title does not match with the current release. Should be an easy fix I suppose.
In the README, the CONTRIBUTING section has a reference to the guidelines Interested in contributing? Check out the contributing guidelines.
, however, I do not see the CONTRIBUTING.md file in the package root folder. Will be good to include the file and link it as well in the README.
The 100% code coverage implies you have done a perfect job of thoroughly testing. There are multiple test that functions in each test script which confused me since there is no overall description as to what aspect of the code is it testing. Should be an easy update to include comments for each test that function.
The USAGE section in the README could include sample output plots generated using the examples provided in the code that will help demonstrate the functions.
There are references to violin plots in the detect_outlier
function, however, the actual code is for the boxplot. Should be an easy fix.
Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
The package includes all the following forms of documentation:
URL
, BugReports
and Maintainer
(which may be autogenerated via Authors@R
).Estimated hours spent reviewing:
Hey team! Nice work. I am late to the game but both my fellow reviewers and myself had to work pretty hard to find issue, which speaks to the good work you all have done. This package is dead easy to use and would save users a nice chunk of time regardless of their project. A few nitpicks below.
100, 100
as arguments but those dont appear to be accurate for this function.
name: snapedautilityR about: snapedautilityR is an open-source library that generate useful function to kickstart EDA (Exploratory Data Analysis) with just a few lines of code.
Submitting Author Name: Kyle Ahn Submitting Author Github Handle: !--author1-->@AraiYuno<!--end-author1-- Other Package Authors Github handles: @harryyikhchan, @dol23asuka Repository: https://github.com/UBC-MDS/snapedautilityR Version submitted: 0.2.0 Submission type: Standard
Editors:
Reviewers:
Reviewer 1: @iamMoid
Reviewer 2: @artanzand
Reviewer 3: @gfairbro
Reviewer 4: @rezam747
Archive: TBD Version accepted: TBD Language: en
Paste the full DESCRIPTION file inside a code block below:
Scope
Please indicate which category or categories from our package fit policies this package falls under: (Please check an appropriate box below. If you are unsure, we suggest you make a pre-submission inquiry.):
Explain how and why the package falls under these categories (briefly, 1-2 sentences): This package offers utility functions that provide the basic EDA visualizations including histograms, outlier detection, and correlation plot.
Who is the target audience and what are the scientific applications of this package? Any data professionals at the entry-level who would like to conduct a quick exploratory data analysis.
Are there other R packages that accomplish the same thing? If so, how does yours differ or meet our criteria for best-in-category? R packages such as
Data Explorer
anddlookr
provide a more broad range of utility functions to conduct data extraction, transformation, and modeling in addition to exploratory data analysis. snapedautilityR aims to only concentrate on the EDA by keeping the package lightweight and less risky dependency conflicts.(If applicable) Does your package comply with our guidance around Ethics, Data Privacy and Human Subjects Research?
If you made a pre-submission inquiry, please paste the link to the corresponding issue, forum post, or other discussion, or @tag the editor you contacted.
Explain reasons for any
pkgcheck
items which your package is unable to pass.Technical checks
Confirm each of the following by checking the box.
This package:
Publication options
[x] Do you intend for this package to go on CRAN?
[x] Do you intend for this package to go on Bioconductor?
[x] Do you wish to submit an Applications Article about your package to Methods in Ecology and Evolution? If so:
MEE Options
- [ ] The package is novel and will be of interest to the broad readership of the journal. - [ ] The manuscript describing the package is no longer than 3000 words. - [ ] You intend to archive the code for the package in a long-term repository which meets the requirements of the journal (see [MEE's Policy on Publishing Code](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/journal-resources/policy-on-publishing-code.html)) - (*Scope: Do consider MEE's [Aims and Scope](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/aims-and-scope/read-full-aims-and-scope.html) for your manuscript. We make no guarantee that your manuscript will be within MEE scope.*) - (*Although not required, we strongly recommend having a full manuscript prepared when you submit here.*) - (*Please do not submit your package separately to Methods in Ecology and Evolution*)Code of conduct