Open calsvein opened 3 years ago
Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
I am currently taking classes with the UBC MDS program alongside the authors of this package.
[x] As the reviewer I confirm that there are no conflicts of interest for me to review this work
The package includes all the following forms of documentation:
URL
, BugReports
and Maintainer
(which may be autogenerated via Authors@R
).For packages co-submitting to JOSS
- [ ] The package has an obvious research application according to JOSS's definition
The package contains a
paper.md
matching JOSS's requirements with:
- [ ] A short summary describing the high-level functionality of the software
- [ ] Authors: A list of authors with their affiliations
- [ ] A statement of need clearly stating problems the software is designed to solve and its target audience.
- [ ] References: with DOIs for all those that have one (e.g. papers, datasets, software).
Estimated hours spent reviewing: 1.5 hours
After installing and running your package, I can say that everything on my end is running smoothly. Good Job!
My main feedback falls into 2 categories, repository quality and consistency between functions. Keep in mind that I am being picky here and that overall the project is very good.
Repository
The link in your CONTRIBUTING.md that directs users to your repository CODE_OF_CONDUCT.md is broken. (I believe you might have shifted file structure and forgotten to alter links)
I found the Fixing Typos section of the CONTRIBUTING.md a little misleading at first. To me it suggests that I should be able to directly edit your repository on GitHub if I want to make a typo fix. In reality users still must fork the repo. Consider either making this more explicit in the section or moving the Fixing Typos section to the bottom of the file, after an explanation has been given about forking the repo.
My suggestion would be to include all authors within the LICENSE.md. Currently Syad is the only one.
Consider changing the file structure. Specifically, I struggled to find the CONTRIBUTING.md as it was in the .github directory. Please keep in mind that if you make this change, you will need to alter all links that are referencing it at it's current location.
Function Consistency
I notice that some functions explicitly use the return
keyword, while others just call the returned object. Consider keeping this consistent between functions for a more professional feel to your code.
I notice that although the roxygen2 style is followed in general, the content of the docstrings are slightly inconsistent. Specifically, some @params
will list the object and it's type, while others will list the object along with a brief explanation of the object. Consider keeping this consistent between functions for a more professional feel to your code.
Whilst reading through your docstrings, I made small edits that you may or may not want to implement. The 3 things that I did were:
#'
)#' #'
)You can find my suggested changes here in a PR.
Overall great work. I look forward to hearing from you.
So nice of you to open a PR to send your suggested changes @nphaterp!
Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
The package includes all the following forms of documentation:
URL
, BugReports
and Maintainer
(which may be autogenerated via Authors@R
).For packages co-submitting to JOSS
- [ ] The package has an obvious research application according to JOSS's definition
The package contains a
paper.md
matching JOSS's requirements with:
- [ ] A short summary describing the high-level functionality of the software
- [ ] Authors: A list of authors with their affiliations
- [ ] A statement of need clearly stating problems the software is designed to solve and its target audience.
- [ ] References: with DOIs for all those that have one (e.g. papers, datasets, software).
Estimated hours spent reviewing: 1
clean_df
function, exception is handled for the raw_tweets_df
, consider exception handling for other parameters?
name: rtweetclean about: Use this template to submit software for review
Submitting Author:
Repository: https://github.com/UBC-MDS/rtweetclean Version submitted: 0.3 Editor: Tiffany Timbers(@ttimbers ) Reviewers: TBD
Archive: TBD Version accepted: TBD
Scope
Please indicate which category or categories from our package fit policies this package falls under: (Please check an appropriate box below. If you are unsure, we suggest you make a pre-submission inquiry.):
Explain how and why the package falls under these categories (briefly, 1-2 sentences):
The tweepy package extracts tweet data, but it is not in a format that it is ready for analysis. Tweepyclean performs functions to convert tweepy extracted data into a machine-readable dataframe, performs feature engineering, and creates summary statistics and basic visualizations.
The audience is strictly intended for those who are already using the tweepy package and have a Twitter API key.
Not that I am aware of.
(If applicable) Does your package comply with our guidance around Ethics, Data Privacy and Human Subjects Research?
If you made a pre-submission enquiry, please paste the link to the corresponding issue, forum post, or other discussion, or @tag the editor you contacted.
Technical checks
Confirm each of the following by checking the box.
This package:
Publication options
[ ] Do you intend for this package to go on CRAN?
[ ] Do you intend for this package to go on Bioconductor?
[ ] Do you wish to submit an Applications Article about your package to Methods in Ecology and Evolution? If so:
MEE Options
- [ ] The package is novel and will be of interest to the broad readership of the journal. - [ ] The manuscript describing the package is no longer than 3000 words. - [ ] You intend to archive the code for the package in a long-term repository which meets the requirements of the journal (see [MEE's Policy on Publishing Code](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/journal-resources/policy-on-publishing-code.html)) - (*Scope: Do consider MEE's [Aims and Scope](http://besjournals.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)2041-210X/aims-and-scope/read-full-aims-and-scope.html) for your manuscript. We make no guarantee that your manuscript will be within MEE scope.*) - (*Although not required, we strongly recommend having a full manuscript prepared when you submit here.*) - (*Please do not submit your package separately to Methods in Ecology and Evolution*)Code of conduct