UBC-MDS / software-review

MDS Software Peer Review of MDS-created packages
1 stars 0 forks source link

Submission: RDataPeek (R) #2

Open moniquewong opened 4 years ago

moniquewong commented 4 years ago

Submitting Author: Monique Wong (@moniquewong), Alistair Clark (@alistair-clark) , Miro Hu (@mirohu), Thomas Pin (@MrThomasPin) Repository: https://github.com/UBC-MDS/RDataPeek Version submitted: 1.1.1 Editor: @kvarada Reviewer 1: @cgostic Reviewer 2: @HanyingZhang Archive: TBD Version accepted: TBD


Package: RDataPeek
Title: Generates a Visual Summary of a Dataset
Version: 0.0.0.9000
Authors@R: 
    c(person(given = "Alistair",
           family = "Clark",
           role = c("aut", "cre", "ctb", "cph"),
           email = "alistair.jl.clark@gmail.com"),
       person(given = "Shangjing",
           family = "Hu",
           role = c("aut", "ctb", "cph")),
        person(given = "Thomas",
           family = "Pin",
           role = c("aut", "ctb", "cph")),
        person(given = "Monique",
           family = "Wong",
           role = c("aut", "ctb", "cph")))
Description: RDataPeek is a package that enables data scientists to efficiently generate a visual summary of a dataset. This package includes functions that show the size of the dataset, a visual summary of missing data, a sample of the dataset showing the data types as well as exploratory visualizations for quantitative and qualitative data.This package is also useful for business users who have to interact with data and want to begin exploring the data without using too much code or having to open a potentially large dataset on Excel. 
License: MIT + file LICENSE
Encoding: UTF-8
LazyData: true
Suggests: 
    testthat (>= 2.1.0),
    covr,
    knitr,
    rmarkdown
RoxygenNote: 7.0.2
Imports: 
    magrittr,
    tools,
    readxl,
    reshape2,
    readr,
    ggplot2,
    tm,
    wordcloud,
    dplyr
URL: https://github.com/UBC-MDS/RDataPeek
BugReports: https://github.com/UBC-MDS/RDataPeek/issues
VignetteBuilder: knitr

Scope

  1. Base R’s summary(): This function computes summary statistics for R dataframes. Our package differs in that it aims to offer summary statistics dependent on data type, including long form text data.
  2. R ggplot2: Our package will leverage ggplot2 to create visualizations that summarize the dataset as well as user-defined features in the dataset. There are existing recommended visualizations for exploratory data analysis such as missing data visualizations which will be adapted where appropriate.
  3. R Word Cloud: This package will also be used to create summary visualizations for long form text data. - If you made a pre-submission enquiry, please paste the link to the corresponding issue, forum post, or other discussion, or @tag the editor you contacted.

Technical checks

Confirm each of the following by checking the box.

This package:

Publication options

JOSS Options - [ ] The package has an **obvious research application** according to [JOSS's definition](https://joss.readthedocs.io/en/latest/submitting.html#submission-requirements). - [ ] The package contains a `paper.md` matching [JOSS's requirements](https://joss.readthedocs.io/en/latest/submitting.html#what-should-my-paper-contain) with a high-level description in the package root or in `inst/`. - [ ] The package is deposited in a long-term repository with the DOI: - (*Do not submit your package separately to JOSS*)
MEE Options - [ ] The package is novel and will be of interest to the broad readership of the journal. - [ ] The manuscript describing the package is no longer than 3000 words. - [ ] You intend to archive the code for the package in a long-term repository which meets the requirements of the journal (see [MEE's Policy on Publishing Code](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/journal-resources/policy-on-publishing-code.html)) - (*Scope: Do consider MEE's [Aims and Scope](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/aims-and-scope/read-full-aims-and-scope.html) for your manuscript. We make no guarantee that your manuscript will be within MEE scope.*) - (*Although not required, we strongly recommend having a full manuscript prepared when you submit here.*) - (*Please do not submit your package separately to Methods in Ecology and Evolution*)

Code of conduct

cgostic commented 4 years ago

In Progress

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

For packages co-submitting to JOSS

The package contains a paper.md matching JOSS's requirements with:

  • [ ] A short summary describing the high-level functionality of the software
  • [ ] Authors: A list of authors with their affiliations
  • [ ] A statement of need clearly stating problems the software is designed to solve and its target audience.
  • [ ] References: with DOIs for all those that have one (e.g. papers, datasets, software).

Functionality

Final approval (post-review)

Estimated hours spent reviewing:


Review Comments

Functions

README

Vignette

Documentation

Folder Structure

Note that a left a few boxes unchecked above, and I think I addressed the reasoning for each within the comments above. If you're unsure about any unchecked box, feel free to shoot me a message!

HanyingZhang commented 4 years ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

For packages co-submitting to JOSS

The package contains a paper.md matching JOSS's requirements with:

  • [ ] A short summary describing the high-level functionality of the software
  • [ ] Authors: A list of authors with their affiliations
  • [ ] A statement of need clearly stating problems the software is designed to solve and its target audience.
  • [ ] References: with DOIs for all those that have one (e.g. papers, datasets, software).

Functionality

Final approval (post-review)

Estimated hours spent reviewing:


Review Comments

Functionality

The functions are all well structured. And all run well for me. A little suggestion would be: I noticed that all the functions save the outputs to files. Would it be better to have another option to save outputs as returned objects for users who do not want to save them to files?

Readme

Documentation

Other suggestions

Overall, the package seems to be very useful and practical to me. The helper functions make the main functions very well-structured and hard to break. I couldn't find any major issues. Great work! I am open to any discussion if you have any doubts on these comments.

MrThomasPin commented 4 years ago

Hello Hanying,

I have addressed some of your concerns they have been checked off:

Functionality "Would it be better to have another option to save outputs as returned objects for users who do not want to save them to files?"

Unforchanently some of our plots are base R plots which are unable to be saved as objects. Furthermore, we made the decision to let users download them for their business reports.

Readme

Documentation

Other suggestions

MrThomasPin commented 4 years ago

Hello Cari,

I have addressed some of your concerns they have been checked off:

Functions

README

Vignette

Folder Structure