ropensci / refsplitr

R package for processing, organizing, and visualizing reference records downloaded from the Web of Science.
https://docs.ropensci.org/refsplitr
Other
55 stars 6 forks source link

Vignette tweaks #70

Closed njahn82 closed 5 years ago

njahn82 commented 5 years ago

Please find some suggestions to improve the vignette made during the rOpenSci software review. See the commit messages for more details.

bmkramer commented 5 years ago

Hi all,

Wow, following this process has shown me a lot about the large effort that goes into creating a full package, including making sure it builds right. Really impressed by all the work and commitment!

Going through the package's workflow from a user perspective, I noticed that there a still a few inconsistencies/omissions in the description of the functions in the vignette, mostly to do with the arguments listed and the code examples given. I've listed these below.

I was able to run all functionality with the test dataset (example_data.txt), including all visualizations. When I tested with a user-generated dataset from Web of Science (.txt file, 490 records), I encountered two instances that gave error messages:

  1. In the step 'splitting author records' in authors_clean, the following error message was encountered at 48% progress and execution was halted:

_Error in data.frame(names = unique(unlist(strsplit(C1names, "; "))), : row names contain missing values

When using only a subset of the file (1st 100 records), this error did not occur.

  1. Running plot_net_country() gave the following error: Error in Ops.data.frame(fromC, toC) : ‘+’ only defined for equally-sized data frames

Let me know if you want to have those files for testing (can't link them here b/c of WoS restrictions)

Below my comments on my observations on the vignette for the different functions in the package:

2.1 references_read()

Appendix 1

2.2. authors_clean()

2.2.1-2.2.3 authors_refine()

2.4.1 plot_addresses_country()

2.4.2 plot_addresses_points()

2.4.3 plot_net_coauthor()

2.4.5 plot_net_address()

Hope this is still helpful!