ropensci / refsplitr

R package for processing, organizing, and visualizing reference records downloaded from the Web of Science.
https://docs.ropensci.org/refsplitr
Other
55 stars 6 forks source link

ROpenSci Review - Documentation #61

Closed aurielfournier closed 5 years ago

aurielfournier commented 5 years ago

After downloading the data, my first attempts loading the file into R failed:

library(refnet)
my_data <- references_read(data = "wos_ropensci.txt")

## Error in references_read(data = "wos_ropensci.txt"): ERROR:  The specified file or directory does not contain any 
##          Web of Knowledge or ISI Export Format records!

It took me a while (and many manual downloads from the WoS) to realize, that the param dir needs to be set to FALSE when I want to load just one file.

my_data <- references_read(data = "wos_ropensci.txt", dir = FALSE)

I feel that the average R user is not as patient when appropriate starting instructions are missing. My main request as reviewer would be therefore to improve high-level documentation, as well as to provide a sample dataset to play with.

I suggest expanding the README and to present an overview and some details in a refnet-package.Rd file, which is currently missing, so that users can type ?refnet-package for help.

Runnable documentation

Although the long-form documentation nicely explains the motivation and the workflow, it seems that the vignette does not process code chunks with functions from the package. I would suggest to add executable examples to successfully demonstrate to the users what can be done with the package. It would also be helpful to include an Rmarkdown file used to generate README.md with at least one runnable example.

In the Documentation, the brief "About" refers to Thomson Reuters as company behind the Web of Science. Ownership changed recently to Clarivate Analytics.

reviewer 2

-> address_column has value -> retry limit not discussed

In documentation: not clear when which geocoding application is used when (sequentially?).
http://www.datasciencetoolkit.org/ and/or
https://developers.google.com/maps/documentation/.

In documentation, it is stated 'an output/file of references that refnet was unable to georeference, which the user can review, manually correct, and import back into the file of georeferenced author locations
-> file seems to contain all lines (with and without lat/long resolved)
-> unclear how 'import back into file' should be performed