Open hjw0703 opened 2 years ago
Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
Briefly describe any working relationship you have (had) with the package authors.: We are in the same cohort of the UBC MDS Program. I'm the reviewer of this package under the course DSCI524
The package includes all the following forms of documentation:
URL
, BugReports
and Maintainer
(which may be autogenerated via Authors@R
).Estimated hours spent reviewing: 1 hour
Keep up the great work! You made a great improvement to build up the R package for everyone. Here are my suggestions.
return()
only for early returns and rely on the final result of the last evaluated expression. Here are the details of when to use the return()
hereget_correlated_features()
, I think the input argument threshold can be assigned to a default value. It makes the function more intuitive and user-friendlyflag_outliers
, I tried a dataframe with categorical columns. It didn't return a proper error message. It would be great to return error messages such as "The input data must be numeric".flag_outliers
, it would be better if you can include a brief description of how the threshold value works under the hood. Some readers may want to get more details.Overall, I was just being picky and I really find this package very useful!
Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
The package includes all the following forms of documentation:
URL
, BugReports
and Maintainer
(which may be autogenerated via Authors@R
).Estimated hours spent reviewing: 1 hr
Great work again everyone! Please find my suggestions below:
missing_imputer
, it would be nice to list the possible methods of imputation in the roxygen2 comments, similar to how it was shown in the python version of the package.flag_outliers
, it would be useful to clarify how NA values are handled by the function in the percentage calculations (i.e. are they included or excluded?). correlated-features
, exception handling could be added for the consider_sign
argument. correlated-features
, an error could be raised when a data frame contains a non-numeric column to improve usability.Best, Jennifer
Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
The package includes all the following forms of documentation:
URL
, BugReports
and Maintainer
(which may be autogenerated via Authors@R
).Estimated hours spent reviewing: 1hr
Great package and everything worked as expected! Great job again everyone! (I had to be super picky to find any improvements you guys may need XD) please find my comments below:
data
parameter in the Roxygen2 comments of the missing_imputer
functionoverview
function. So far you’ve checked the sum of the means and sum of variance, maybe you can include an additional one for the medians and stds, or just one for the entire df. Overview
function like how you guys did in the other three functions.flag_outliers
function.Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
The package includes all the following forms of documentation:
URL
, BugReports
and Maintainer
(which may be autogenerated via Authors@R
).Estimated hours spent reviewing:
Your package is impressive guys! I just have a few minor comments:
Submitting Author Name: Michelle Wang(@michelle-wms), Jiwei Hu (@hjw0703), Anupriya Srivastava (@Anupriya-Sri), Sam Quist(@squisty)
Repository: (https://github.com/UBC-MDS/nedahelpeR) Version submitted:1.0.0 Submission type: Standard Editor: Michelle Wang(@michelle-wms), Jiwei Hu (@hjw0703), Anupriya Srivastava (@Anupriya-Sri), Sam Quist(@squisty)
Reviewer 1: Ruben De la Garza Macias Reviewer 2: Jennifer Hoang Reviewer 3: Harry Chan Reviewer 4: Junrong Zhu
Scope
* Please fill out a pre-submission inquiry before submitting a data visualization package. For more info, see [notes on categories][NotesOnCategories] of our guidebook.
The package includes functions which can complete the following tasks:
Data scientists who often spend a lot of time preprocessing data to extract useful parts for their analysis.
Technical checks
Confirm each of the following by checking the box.
This package:
Publication options
[x] Do you intend for this package to go on CRAN?
[ ] Do you intend for this package to go on Bioconductor?
[ ] Do you wish to submit an Applications Article about your package to Methods in Ecology and Evolution? If so:
MEE Options
- [ ] The package is novel and will be of interest to the broad readership of the journal. - [ ] The manuscript describing the package is no longer than 3000 words. - [ ] You intend to archive the code for the package in a long-term repository which meets the requirements of the journal (see [MEE's Policy on Publishing Code](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/journal-resources/policy-on-publishing-code.html)) - (*Scope: Do consider MEE's [Aims and Scope](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/aims-and-scope/read-full-aims-and-scope.html) for your manuscript. We make no guarantee that your manuscript will be within MEE scope.*) - (*Although not required, we strongly recommend having a full manuscript prepared when you submit here.*) - (*Please do not submit your package separately to Methods in Ecology and Evolution*)Code of conduct