UBC-MDS / software-review-2022

0 stars 0 forks source link

Submission Group 03: gdphelper (Python) #13

Open gfairbro opened 2 years ago

gfairbro commented 2 years ago

Submitting Author: Name @aldojasb, @gfairbro, @ramiromejia, @gloriawyy Package Name: gdphelper One-Line Description of Package: Data extractor, cleaner and summarizer of Canadian historical GDP Data. Repository Link: https://github.com/UBC-MDS/gdphelper Version submitted: v 1.1.8 ( https://github.com/UBC-MDS/gdphelper/tree/v1.1.8)


Description

This package is designed to take the url of any of the several dozen GDP-related csv datasets from the Canadian Government Open Data Portal and download, clean load, summarize and visualize the data contained within.

Scope

* Please fill out a pre-submission inquiry before submitting a data visualization package. For more info, see notes on categories of our guidebook.

This package is used to download, extract and format data from the open government portal.

Data analysts, statisticians or any interested party looking to use data on historical GDP across Canada.

Not with specific scope for this data.

Technical checks

For details about the pyOpenSci packaging requirements, see our packaging guide. Confirm each of the following by checking the box. This package:

Publication options

JOSS Checks - [ ] The package has an **obvious research application** according to JOSS's definition in their [submission requirements][JossSubmissionRequirements]. Be aware that completing the pyOpenSci review process **does not** guarantee acceptance to JOSS. Be sure to read their submission requirements (linked above) if you are interested in submitting to JOSS. - [ ] The package is not a "minor utility" as defined by JOSS's [submission requirements][JossSubmissionRequirements]: "Minor ‘utility’ packages, including ‘thin’ API clients, are not acceptable." pyOpenSci welcomes these packages under "Data Retrieval", but JOSS has slightly different criteria. - [ ] The package contains a `paper.md` matching [JOSS's requirements][JossPaperRequirements] with a high-level description in the package root or in `inst/`. - [ ] The package is deposited in a long-term repository with the DOI: *Note: Do not submit your package separately to JOSS*

Are you OK with Reviewers Submitting Issues and/or pull requests to your Repo Directly?

This option will allow reviewers to open smaller issues that can then be linked to PR's rather than submitting a more dense text based review. It will also allow you to demonstrate addressing the issue via PR links.

Code of conduct

P.S. *Have feedback/comments about our review process? Leave a comment here

Editor and Review Templates

Editor and review templates can be found here

nobbynguyen commented 2 years ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Readme requirements The package meets the readme requirements below:

The README should include, from top to bottom:

Usability

Reviewers are encouraged to submit suggestions (or pull requests) that will improve the usability of the package as a whole. Package structure should follow general community best-practices. In general please consider:

Functionality

For packages co-submitting to JOSS

Note: Be sure to check this carefully, as JOSS's submission requirements and scope differ from pyOpenSci's in terms of what types of packages are accepted.

The package contains a paper.md matching JOSS's requirements with:

Final approval (post-review)

Estimated hours spent reviewing: 2hours


Review Comments

Hi team gdhelper! I have reviewed your package and I want to congratulate you on the release of this package. It is clearly that you have put a lot of effort on this. I think the package would be useful for a lot of people who are working with GDP-related data. In the next part, I would like to make some comments and I hope the comments would contribute to your team's success.

Functionality:
Documentation:

Let me know your thoughts and if anything needs clarification! Nobby Nguyen

gfairbro commented 2 years ago

Thanks so much for the review Nobby, I wanted to let you know that we fixed the example which may help you run the package; when you call the modules you either have to call it like so gdpimporter.gdpimporter(URL), or you have to import the individual functions (from gdphelper import gdpimporter.gdpimporter) .

This is owing to the fact that the files are within their own scripts rather than part of one big .py file. We are looking into how to improve this syntax.

Thanks for your other documentation recommendations, I have added the readthedocs link as you suggested.

jcasoli commented 2 years ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Readme requirements The package meets the readme requirements below:

The README should include, from top to bottom:

Usability

Reviewers are encouraged to submit suggestions (or pull requests) that will improve the usability of the package as a whole. Package structure should follow general community best-practices. In general please consider:

Functionality

For packages co-submitting to JOSS

Note: Be sure to check this carefully, as JOSS's submission requirements and scope differ from pyOpenSci's in terms of what types of packages are accepted.

The package contains a paper.md matching JOSS's requirements with:

Final approval (post-review)

Estimated hours spent reviewing: 1


Review Comments

Love the package! I think you've done a great job making it easy to work with data from the open government portal. I only have a few minor ideas for improvement

  1. In the gdpplotterr() function, I think it would be really cool if you could also pass in a vector of province names as an argument. This would easily allow me to get a quick glance at the GDP's of the provinces i'm interested in. At the same time It's a bit hard to read the current graph output as there are 11 different lines that show up.

  2. Although there is a brief description in the README file, I think it would be super helpful to have an even more flushed out statement of need. ie even though it may be somewhat obvious.. what problem does your package solve? Why is it needed?

  3. Overall really nice job on the documentation! I found it super easy to install the package and get started right away using the different functions. The only thing I noticed was that maybe the ReadTheDocs website could provide some additional context over and above just how the package functions work. For example, it would be great if it provided some additional information about what kind of data the package works on and where to access the data.

gauthampughazhendhi commented 2 years ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Readme requirements The package meets the readme requirements below:

The README should include, from top to bottom:

Usability

Reviewers are encouraged to submit suggestions (or pull requests) that will improve the usability of the package as a whole. Package structure should follow general community best-practices. In general please consider:

Functionality

For packages co-submitting to JOSS

Note: Be sure to check this carefully, as JOSS's submission requirements and scope differ from pyOpenSci's in terms of what types of packages are accepted.

The package contains a paper.md matching JOSS's requirements with:

Final approval (post-review)

Estimated hours spent reviewing:

1 hour 30 minutes

Review Comments

Working with government data is often challenging in terms of choosing the right dataset and pre-processing. I think you guys did a great job in building this package to analyze the GDP data.

Pros:

  1. The function APIs are simple, intuitive and it's very easy to use if someone with no previous experience using this package wants to incorporate the functionalities provided into their work. The usage section of the README.md is well-written, including a concise example of using all the functions sequentially. This will definitely help the user to get started with the package quickly.
  2. Good work in allowing the user to pick the statistics that they wish to explore with good defaults as part of the gdpdescribe() function.

Improvements:

  1. Code formatting can be improved in a lot of ways such as restricting the code and documentation lines to 79 characters as per PEP standard, including two line spaces between imports and the functions.
  2. Though gdpdescribe() function allows the user to pick the stats, it calculates all the stats irrespective of what the user wanted and then filters the results. This can be further improved by calculating only the stats that the user wanted.
  3. The example notebook in the readthedocs website shows only how to check the version number. It would be great if you could add some code examples of your package there.
rrrohit1 commented 2 years ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Readme requirements The package meets the readme requirements below:

The README should include, from top to bottom:

Usability

Reviewers are encouraged to submit suggestions (or pull requests) that will improve the usability of the package as a whole. Package structure should follow general community best-practices. In general please consider:

Functionality

For packages co-submitting to JOSS

Note: Be sure to check this carefully, as JOSS's submission requirements and scope differ from pyOpenSci's in terms of what types of packages are accepted.

The package contains a paper.md matching JOSS's requirements with:

Final approval (post-review)

Estimated hours spent reviewing: 1 hour

Review Comments

Great job guys! The package's theme of GDP is relevant and would be useful for users interested in this domain. The package installs smoothly in my local MAC machine.