UBC-MDS / software-review-2022

0 stars 0 forks source link

Group 20: clevercloudr (R) #60

Open Arushi282 opened 2 years ago

Arushi282 commented 2 years ago

name: clevercloudr about: Creating meaningful and customized word clouds


Submitting Author Names: Arushi Ahuja @Arushi282, Amelia Tang @aimee0317, Victor Francis @Vikiano, Adrianne Leung @adrianne-l Repository: https://github.com/UBC-MDS/clevercloudr Version submitted: v1.0.1 Submission type: Standard Editors: Arushi Ahuja @Arushi282, Amelia Tang @aimee0317, Victor Francis @Vikiano, Adrianne Leung @adrianne-l Reviewers: Mukund Iyer @miyer26, Hatef Rahmani @hatefr, Arijeet CHATTERJEE @arijc76, John Lee @johnwslee Language: R


Package: clevercloudr
Title: Creating Meaningful and Visually Appealing Word Clouds!
Version: 0.0.0.9000
Authors@R: 
  c(person(given = "Victor",
           family = "Francis",
           role = c("aut", "cre"),
           email = "francisvictorvictor47@gmail.com"),
    person(given = "Amelia",
           family = "Tang",
           role = "ctb",
           email = "aimee.tv21@gmail.com"),
    person(given = "Adrianne",
           family = "Leung",
           role = "ctb",
           email = "adrianne.l.wk@gmail.com"),
    person(given = "Arushi",
           family = "Ahuja",
           role = "ctb",
           email = "arushiahuja28@gmail.com"))
Description: This package is developed to serve as the one-step solution to create meaningful and visually appealing word clouds. To create meaningful word clouds, data scientists typically takes multiple steps to clean the data, such as removing stopwords, removing punctuation and digits, making the letters lower cases, conducting lemmatization and stemming. This package will help data scientists clean the data easily following the common practices and also allow to create more visually appealing word clouds in relevant shapes.
License: MIT + file LICENSE
Encoding: UTF-8
Roxygen: list(markdown = TRUE)
RoxygenNote: 7.1.2
Config/testthat/edition: 3
Imports: 
    corpus,
    dplyr,
    htmlwidgets,
    RColorBrewer,
    stopwords,
    stringr,
    testthat,
    tm,
    usethis,
    vecsets,
    webshot,
    wordcloud,
    wordcloud2

Scope

The package extracts the frequency of words and displays them in a word cloud image that is easy to interpret.

People who need to do basic visualization for Natural Language Processing (NLP).

WordCloud: a word count generator that emphasis more frequently used words from an array of strings and represents them in the form of an image.

What we do differently: (1) Our aim is to improve on the pre-processing of strings before creating a wordcloud in order to make it more user specific and efficient. (2) Word cloud only eliminates limited amount of stopwords, but with our package we are giving users the opportunity to add more stopwords that cater to their analysis. (3) We are focused on removing as many redundant and duplicate words by setting strings to lower case, removing punctuation, lemmatizing and stemming the text.

Technical checks

Confirm each of the following by checking the box.

This package:

Publication options

MEE Options - [ ] The package is novel and will be of interest to the broad readership of the journal. - [ ] The manuscript describing the package is no longer than 3000 words. - [ ] You intend to archive the code for the package in a long-term repository which meets the requirements of the journal (see [MEE's Policy on Publishing Code](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/journal-resources/policy-on-publishing-code.html)) - (*Scope: Do consider MEE's [Aims and Scope](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/aims-and-scope/read-full-aims-and-scope.html) for your manuscript. We make no guarantee that your manuscript will be within MEE scope.*) - (*Although not required, we strongly recommend having a full manuscript prepared when you submit here.*) - (*Please do not submit your package separately to Methods in Ecology and Evolution*)

Code of conduct

johnwslee commented 2 years ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Functionality

Estimated hours spent reviewing: 1.5 hrs


Review Comments

  1. Installation guide was simple and easy to follow. Installation was successful without any difficulties. The example code in README was very helpful for me to check the functionality of your package.

  2. The function codes are well written and well documented. Like the case of python package, addition of more codes for testing would increase the robustness of the package.

  3. The badges in the README are really helpful for others to quickly find out the status of the package and the repo. It is great that the code coverage is higher than 90%.

  4. I am just wondering why JavaScript is being displayed as the mostly used language in your GitHub repo. You can find the languages section in the righthand side of your GitHub repo.

  5. I would be better to clean up the To do lists from the project board. Overall, your package and repo looks great.

arijeetchatterjee commented 2 years ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Functionality

Estimated hours spent reviewing: 1 hour


Review Comments

I installed the package without any issues and I could follow the example as provided in the README. The functionality of the package looks great. I have mentioned my observations below:

From the code and functionality perspective, I don't think I can suggest any improvements that are required in this package at this stage. Once again, a great job on developing this package.

hatefr commented 2 years ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Functionality

Estimated hours spent reviewing:


Review Comments

I was able to successfully install the package without any issues. In general, the package looks great and is very useful. I only have some minor suggestions to further improve the package:

Overall it is a very awesome package, and great work Group 20!

miyer26 commented 2 years ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Functionality

Estimated hours spent reviewing: 1 hour


Review Comments

Great work on this package! I had no issues installing it and running the example provided under the usage section. Please find below some comments that maybe useful for further development:

1) I see that the CONTRIBUTING file is missing from the root directory. It will be useful to have, and link this to the Contributing section in the README.

2) It will be nice to see the rendered page of the package documentation including the vignette linked on the README.

3) The LICENSE.md file does not list the name of the authors explicitly.

4) The fit within the R ecosystem is well explained. However, other packages that carry out text processing can be mentioned as they are also similar to some of the functions included in this package.

5) The worldcloud2 files are currently under the test folder - perhaps it would make more sense for it to reside under the src folder?

All in all this package provides fantastic utility so kudos to all of you!