openjournals / joss-reviews

Reviews for the Journal of Open Source Software
Creative Commons Zero v1.0 Universal
725 stars 38 forks source link

[REVIEW]: rdataretriever: An R package for downloading, cleaning, and installing publicly available datasets #2800

Closed whedon closed 3 years ago

whedon commented 4 years ago

Submitting author: @henrykironde (henry senyondo) Repository: https://github.com/ropensci/rdataretriever Version: v3.0.1 Editor: @fboehm Reviewer: @rmhogervorst, @jsgalan Archive: 10.5281/zenodo.4314115

:warning: JOSS reduced service mode :warning:

Due to the challenges of the COVID-19 pandemic, JOSS is currently operating in a "reduced service mode". You can read more about what that means in our blog post.

Status

status

Status badge code:

HTML: <a href="https://joss.theoj.org/papers/1a121f31d1ef48bcef1c971539888ea0"><img src="https://joss.theoj.org/papers/1a121f31d1ef48bcef1c971539888ea0/status.svg"></a>
Markdown: [![status](https://joss.theoj.org/papers/1a121f31d1ef48bcef1c971539888ea0/status.svg)](https://joss.theoj.org/papers/1a121f31d1ef48bcef1c971539888ea0)

Reviewers and authors:

Please avoid lengthy details of difficulties in the review thread. Instead, please create a new issue in the target repository and link to those issues (especially acceptance-blockers) by leaving comments in the review thread below. (For completists: if the target issue tracker is also on GitHub, linking the review thread in the issue or vice versa will create corresponding breadcrumb trails in the link target.)

Reviewer instructions & questions

@rmhogervorst & @jsgalan, please carry out your review in this issue by updating the checklist below. If you cannot edit the checklist please:

  1. Make sure you're logged in to your GitHub account
  2. Be sure to accept the invite at this URL: https://github.com/openjournals/joss-reviews/invitations

The reviewer guidelines are available here: https://joss.readthedocs.io/en/latest/reviewer_guidelines.html. Any questions/concerns please let @fboehm know.

Please start on your review when you are able, and be sure to complete your review in the next six weeks, at the very latest

Review checklist for @rmhogervorst

Conflict of interest

Code of Conduct

General checks

Functionality

Documentation

Software paper

Review checklist for @jsgalan

Conflict of interest

Code of Conduct

General checks

Functionality

Documentation

Software paper

whedon commented 4 years ago

Hello human, I'm @whedon, a robot that can help you with some common editorial tasks. @rmhogervorst, @jsgalan it looks like you're currently assigned to review this paper :tada:.

:warning: JOSS reduced service mode :warning:

Due to the challenges of the COVID-19 pandemic, JOSS is currently operating in a "reduced service mode". You can read more about what that means in our blog post.

:star: Important :star:

If you haven't already, you should seriously consider unsubscribing from GitHub notifications for this (https://github.com/openjournals/joss-reviews) repository. As a reviewer, you're probably currently watching this repository which means for GitHub's default behaviour you will receive notifications (emails) for all reviews 😿

To fix this do the following two things:

  1. Set yourself as 'Not watching' https://github.com/openjournals/joss-reviews:

watching

  1. You may also like to change your default settings for this watching repositories in your GitHub profile here: https://github.com/settings/notifications

notifications

For a list of things I can do to help you, just type:

@whedon commands

For example, to regenerate the paper pdf after making changes in the paper's md or bib files, type:

@whedon generate pdf
whedon commented 4 years ago
Reference check summary (note 'MISSING' DOIs are suggestions that need verification):

OK DOIs

- 10.1371/journal.pone.0065848 is OK

MISSING DOIs

- None

INVALID DOIs

- None
whedon commented 4 years ago

:point_right::page_facing_up: Download article proof :page_facing_up: View article proof on GitHub :page_facing_up: :point_left:

fboehm commented 4 years ago

@jsgalan and @RMHogervorst - please check boxes above as you complete your review. For any boxes that you can't check right now, due to needed edits or fixes, please discuss the issues here or open issues in the submission repository. And please let me know if you have any questions. Thank you again!

RMHogervorst commented 4 years ago

@henrykironde some notes about the JOSS paper:

The description in Statement of need is really beautiful and only after reading that part I finally understood what the problem is that this package is solving. I really really wish you'd include something like that in the Readme (and maybe Description) too.

REQUIRED:

OPTIONAL / SUGGESTIONS:

RMHogervorst commented 4 years ago

Required Issues opened:

Optional issues opened:

whedon commented 4 years ago

:wave: @rmhogervorst, please update us on how your review is going.

whedon commented 4 years ago

:wave: @jsgalan, please update us on how your review is going.

fboehm commented 4 years ago

@jsgalan - I hope that the review is going well. As you work through the checklist above, please check the boxes to reflect your approval of the checklist items. Please let me know if there's something that I might do to assist you. Thanks again!

henrykironde commented 4 years ago

Just an update, @RMHogervorst made some great suggestions and I am working to resolve those. Thanks to the issues btw, I guess @jsgalan is inline with the issues raised.

fboehm commented 4 years ago

Thank you for the update, @henrykironde ! Please let me know how the resolution of issues goes, and don't hesitate to contact me when I might help with something.

RMHogervorst commented 4 years ago

Just an update: I feel @henrykironde and others are doing great work. I really look forward to the finished version!

henrykironde commented 4 years ago

Thanks folks. We have covered all the issues, I am happy to get more recommendations to consider.

jsgalan commented 4 years ago

Hi all, is there any confirmation I might be missing?

I am unable to check the ticks for the revision (Just checked in two distinct browsers to discard a software issue).

Thanks

Ps. has the invitation link expired? Attn: @fboehm

jsgalan commented 4 years ago

Hi all,

No problem in the Conda installation

Screen Shot 2020-11-21 at 10 37 20 AM

But a problem was found in the R installation.

Screen Shot 2020-11-21 at 10 43 18 AM

Did not found any description on the repository, are there any set requirements for the R installation?

Best

EDIT: I installed version 3.6.3 and everything installed correctly.

jsgalan commented 4 years ago

Hi all, It seems I was not in a top level repository.

Screen Shot 2020-11-21 at 11 12 33 AM

All datasets are visible from Python

Screen Shot 2020-11-21 at 11 13 19 AM

Had a bit of trouble to finally set up the Reticulate package but everything was set at the end (Extra information that helped https://docs.ropensci.org/rdataretriever/ , https://rstudio.github.io/reticulate/articles/versions.html and https://stackoverflow.com/questions/40788645/how-to-create-renviron-file)

Screen Shot 2020-11-21 at 12 32 46 PM

All the tabular datasets and updates were collected correctly

Screen Shot 2020-11-21 at 12 38 39 PM

Screen Shot 2020-11-21 at 12 38 48 PM

jsgalan commented 4 years ago

Hi all,

I had problems installing the spatial datasets

Screen Shot 2020-11-21 at 1 19 29 PM

Even though I have everything set up in Postgres

Screen Shot 2020-11-21 at 1 16 41 PM

Any ideas?

Best

henrykironde commented 4 years ago

Thanks @jsgalan for testing this. Make sure you are creating the extensions in the same database that you want to install the data. The default is postgres. In case you, are using a different database, please try to provide the database name in the function call. Let me know if that works

jsgalan commented 4 years ago

@whedon assign @jsgalan

whedon commented 4 years ago

I'm sorry human, I don't understand that. You can see what commands I support by typing:

@whedon commands
jsgalan commented 4 years ago

@whedon commands

whedon commented 4 years ago

Here are some things you can ask me to do:

# List Whedon's capabilities
@whedon commands

# List of editor GitHub usernames
@whedon list editors

# List of reviewers together with programming language preferences and domain expertise
@whedon list reviewers

EDITORIAL TASKS

# Compile the paper
@whedon generate pdf

# Compile the paper from alternative branch
@whedon generate pdf from branch custom-branch-name

# Ask Whedon to check the references for missing DOIs
@whedon check references

# Ask Whedon to check repository statistics for the submitted software
@whedon check repository
jsgalan commented 4 years ago

@whedon list reviewers

whedon commented 4 years ago

Here's the current list of reviewers: https://bit.ly/joss-reviewers

jsgalan commented 4 years ago

Thanks @jsgalan for testing this. Make sure you are creating the extensions in the same database that you want to install the data. The default is postgres. In case you, are using a different database, please try to provide the database name in the function call. Let me know if that works

Hi all, the extensions are created in all databases. Now I got a problem with finding shp2pgsql

Screen Shot 2020-11-22 at 12 01 13 PM

even thought the bin is listed in the $PATH

Screen Shot 2020-11-22 at 12 03 21 PM

Plus the binaries are accesible throughout the command line

Screen Shot 2020-11-22 at 12 11 46 PM

Any way this can be overridden?

jsgalan commented 4 years ago

@fboehm I am unable to click any of the ticks, and I somehow managed to exclude myself from the reviewers :(

Any way this could change?

jsgalan commented 4 years ago

Hi @henrykironde

Why are there no references in the article proof?

Screen Shot 2020-11-22 at 12 05 45 PM

Am I reading the latest version?

Best

henrykironde commented 4 years ago

even thought the bin is listed in the $PATH

Can you try to run psql and or shp2pgsql from command line

About the citation, I will update that to render correctly

jsgalan commented 4 years ago

even thought the bin is listed in the $PATH

Can you try to run psql and or shp2pgsql from command line

They are accesible from the command line

Screen Shot 2020-11-22 at 12 11 46 PM
fboehm commented 4 years ago

@fboehm I am unable to click any of the ticks, and I somehow managed to exclude myself from the reviewers :(

Any way this could change?

Yes, Let me fix that @jsgalan.

fboehm commented 4 years ago

@whedon reinvite @jsgalan as reviewer

whedon commented 4 years ago

I'm sorry human, I don't understand that. You can see what commands I support by typing:

@whedon commands
fboehm commented 4 years ago

@whedon commands

whedon commented 4 years ago

Here are some things you can ask me to do:

# List all of Whedon's capabilities
@whedon commands

# Assign a GitHub user as the sole reviewer of this submission
@whedon assign @username as reviewer

# Add a GitHub user to the reviewers of this submission
@whedon add @username as reviewer

# Re-invite a reviewer (if they can't update checklists)
@whedon re-invite @username as reviewer

# Remove a GitHub user from the reviewers of this submission
@whedon remove @username as reviewer

# List of editor GitHub usernames
@whedon list editors

# List of reviewers together with programming language preferences and domain expertise
@whedon list reviewers

# Change editorial assignment
@whedon assign @username as editor

# Set the software archive DOI at the top of the issue e.g.
@whedon set 10.0000/zenodo.00000 as archive

# Set the software version at the top of the issue e.g.
@whedon set v1.0.1 as version

# Open the review issue
@whedon start review

EDITORIAL TASKS

# All commands can be run on a non-default branch, to do this pass a custom 
# branch name by following the command with `from branch custom-branch-name`.
# For example:

# Compile the paper
@whedon generate pdf

# Compile the paper from alternative branch
@whedon generate pdf from branch custom-branch-name

# Remind an author or reviewer to return to a review after a
# certain period of time (supported units days and weeks)
@whedon remind @reviewer in 2 weeks

# Ask Whedon to do a dry run of accepting the paper and depositing with Crossref
@whedon accept

# Ask Whedon to check the references for missing DOIs
@whedon check references

# Ask Whedon to check repository statistics for the submitted software
@whedon check repository

EiC TASKS

# Invite an editor to edit a submission (sending them an email)
@whedon invite @editor as editor

# Reject a paper
@whedon reject

# Withdraw a paper
@whedon withdraw

# Ask Whedon to actually accept the paper and deposit with Crossref
@whedon accept deposit=true
fboehm commented 4 years ago

@whedon re-invite @jsgalan as reviewer

whedon commented 4 years ago

OK, the reviewer has been re-invited.

@jsgalan please accept the invite by clicking this link: https://github.com/openjournals/joss-reviews/invitations

fboehm commented 4 years ago

@jsgalan - sorry for the difficulty. Please click on the link in the previous comment to confirm your addition as a reviewer. After doing that, you should be able to check the boxes for the review, once you're comfortable doing so.

jsgalan commented 4 years ago

@jsgalan - sorry for the difficulty. Please click on the link in the previous comment to confirm your addition as a reviewer. After doing that, you should be able to check the boxes for the review, once you're comfortable doing so.

Thanks @fboehm we will be done with the revision soon

henrykironde commented 4 years ago

@jsgalan, I see you are using a Conda environment, make sure that R has access to the same Python paths as well. Usually, changing to a virtual environment may change the paths. R may be communicating with another python rather that where the programs are installed.

Check the paths in R

library(reticulate)
sys <- import("sys")
sys$path
#Or  
reticulate::py_config()

Compare to the ones when python is used from the terminal

import sys
sys.path
henrykironde commented 4 years ago

@whedon generate pdf

whedon commented 4 years ago

:point_right::page_facing_up: Download article proof :page_facing_up: View article proof on GitHub :page_facing_up: :point_left:

jsgalan commented 4 years ago

@jsgalan, I see you are using a Conda environment, make sure that R has access to the same Python paths as well. Usually, changing to a virtual environment may change the paths. R may be communicating with another python rather that where the programs are installed.

Check the paths in R

library(reticulate)
sys <- import("sys")
sys$path
#Or  
reticulate::py_config()

Compare to the ones when python is used from the terminal

import sys
sys.path

Hi that is pointing to the same place

Screen Shot 2020-11-23 at 9 07 11 AM

and

Screen Shot 2020-11-23 at 9 07 32 AM
henrykironde commented 4 years ago

reticulate::py_config() can we try this too

jsgalan commented 4 years ago

reticulate::py_config() can we try this too

Screen Shot 2020-11-23 at 9 21 05 AM
henrykironde commented 4 years ago

Could you please try this in the terminal

import subprocess
subprocess.call('shp2pgsql -d ', shell=True)

This must be a path problem but it is hard for me to replicate on my side. Could you try running it on another machine or in a virtual environment.

jsgalan commented 4 years ago

Could you please try this in the terminal

import subprocess
subprocess.call('shp2pgsql -d ', shell=True)

This must be a path problem but it is hard for me to replicate on my side. Could you try running it on another machine or in a virtual environment.

Hi,

Screen Shot 2020-11-24 at 12 04 08 PM

I think it is doing exactly what I am telling it to do .. 👍

I am finding a way to change it from /sh to /bash where everything is set up. That will probably solve it

Best

RMHogervorst commented 4 years ago

@fboehm All my concerns where addressed and the article proof seems in order too (I would advise another read through before publishing, because I think some words are missing, but I'm no native speaker). For me the review is done. It looks great, and you have my approval.

Anything else I need to do?

fboehm commented 4 years ago

@RMHogervorst - Thank you so much for your work on this review! There is nothing left for you to do. I'll proofread the article before publishing.

fboehm commented 4 years ago

@jsgalan - Thanks for your work on this review! Do you have any remaining concerns? or have they all been resolved satisfactorily?

fboehm commented 4 years ago

@whedon generate pdf