Open mmaidana24318 opened 2 years ago
Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
The package includes all the following forms of documentation:
URL
, BugReports
and Maintainer
(which may be autogenerated via Authors@R
).Estimated hours spent reviewing:
1 hour
Unit tests run/verified via local instance of package repo.
Summary: Really cool package! I did not run into any issues with installation.
extract_ngram
function might be missing some possible n-grams. For example:
> textprepr::extract_ngram(c("one", "two", "three", "four"), n=2)
[1] "one two" "two three" "three four"
Is it anticipated that "four one" also be a valid n-gram returned by this function?Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
The package includes all the following forms of documentation:
URL
, BugReports
and Maintainer
(which may be autogenerated via Authors@R
).Estimated hours spent reviewing: 2 hours
A cool and useful package!
Here are some suggestions:
if (length(tweets) < n) {
stop("length of ngrams should be less than number of words in vector of tweets")
}
if(!is.character(tweets)) {
stop("input should be a character vector")
}
Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
The package includes all the following forms of documentation:
URL
, BugReports
and Maintainer
(which may be autogenerated via Authors@R
).Estimated hours spent reviewing: 35 minutes
First of all congratulations on creating a wonderful and useful package. The team has done a great job and I really found the documentation, docstrings and examples to be very good. They easily guided me through the process of working with your package. I have added my recommendations below as minor changes I feel could make this already very good package slightly better:
In general, this is great work and I enjoyed using your package!
Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
The package includes all the following forms of documentation:
URL
, BugReports
and Maintainer
(which may be autogenerated via Authors@R
).Estimated hours spent reviewing:
remove_punct()
function in the documentation website.
test-generate_cloud.R
. The unit tests surely look to be testing not only the input parameters but also the behaviour of the function. This would prevent someone to accidentally modify the behaviour of the function in a wrong way!
Team 21 is rocking! Great work :>
Package Name: textprepr
One-Line Description of Package: Text preprocessing functions specifically designed for tweet data.
Submitting Author Name/ Github Handle:
Repository: https://github.com/UBC-MDS/textprepr
Version submitted: v1.0.0
Submission type: Standard
Editor: @arijc76, @joshsia, @mmaidana24318, @PhilsChan
Reviewers:
Language: en
Scope
Please indicate which category or categories from our package fit policies this package falls under: (Please check an appropriate box below. If you are unsure, we suggest you make a pre-submission inquiry.):
Explain how and why the package falls under these categories (briefly, 1-2 sentences): The package bundles functions to help with cleaning and gaining insight into tweet data, providing additional resources for EDA and enabling feature engineering.
Who is the target audience and what are scientific applications of this package? This package is for people interested in performing data analysis on Tweeter data.
Are there other R packages that accomplish the same thing? If so, how does yours differ or meet our criteria for best-in-category? There are no similar R packages available.
(If applicable) Does your package comply with our guidance around Ethics, Data Privacy and Human Subjects Research? N/A
If you made a pre-submission inquiry, please paste the link to the corresponding issue, forum post, or other discussion, or @tag the editor you contacted. N/A
Explain reasons for any
pkgcheck
items which your package is unable to pass. N/ATechnical checks
Confirm each of the following by checking the box.
This package:
Publication options
[ ] Do you intend for this package to go on CRAN?
[ ] Do you intend for this package to go on Bioconductor?
[ ] Do you wish to submit an Applications Article about your package to Methods in Ecology and Evolution? If so:
MEE Options
- [ ] The package is novel and will be of interest to the broad readership of the journal. - [ ] The manuscript describing the package is no longer than 3000 words. - [ ] You intend to archive the code for the package in a long-term repository which meets the requirements of the journal (see [MEE's Policy on Publishing Code](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/journal-resources/policy-on-publishing-code.html)) - (*Scope: Do consider MEE's [Aims and Scope](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/aims-and-scope/read-full-aims-and-scope.html) for your manuscript. We make no guarantee that your manuscript will be within MEE scope.*) - (*Although not required, we strongly recommend having a full manuscript prepared when you submit here.*) - (*Please do not submit your package separately to Methods in Ecology and Evolution*)Code of conduct