jdblischak / ctv-project-workflows

R Task View: Project Workflows
https://jdblischak.github.io/ctv-project-workflows/
Creative Commons Attribution 4.0 International
7 stars 0 forks source link

Initial comments: Additional topics #2

Closed annakrystalli closed 4 years ago

annakrystalli commented 5 years ago

Just throwing a few topics out for discussion.

The frameworks effectively offer higher level approaches to some of the elements mentioned above. I guess the question is how much of the task view, if any, should address the elements?

jdblischak commented 5 years ago

@annakrystalli Thanks for the suggestions!

  • dependency management? (ie should we include things like packrat?

I agree this is an important topic, and I had originally included it in my initial draft. But because it is such a big topic in its own right, and @nuest has already started a Task View dedicated to this topic (Computational Environments and Reproducibility), I decided to provide a link to his Task View instead (commit).

  • data management: data being an important part of project management?

Definitely. Could you suggest some example packages to consider?

  • documentation: I know many of the framework packages address workflow documentation, but I'd argue management of functions can be an important part. So things like roxygen2, pkgdown

For these packages we'll need to make a decision on the scope of the Task View. To me, these are all packages that assist with R package development. My initial reaction is to not include these for the following two reasons:

  1. Package development packages (devtools, roxygen2, testthat, pkdown, etc.) are already widely known and well documented (e.g. r-pkgs book). They are also well-supported in RStudio. So basically, is it worth the effort of expanding the scope of the Task to include them?
  2. A data analysis project may or may not include creating an R package. Many of the frameworks in this Task View are essentially trying to provide the structure and tooling to support a typical data analysis project that an R package already has. In essence, I see a fundamental difference between writing code to analyze data, and writing a reusable toolbox of functions (though of course the lines can get blurred).

Also see here for my previous thoughts on package development packages: https://github.com/jdblischak/r-project-workflows/issues/2#issuecomment-422080619

maybe even rmarkdown in it's own right, could be included?

Literate programming is already covered in the existing Task View ReproducibleResearch, which is included in the section Related Links.

archiving?

Definitely. Do you mean packages like archivist? Maybe this could be combined with data management?

jdblischak commented 5 years ago

@benmarwick @nuest What are your thoughts on potentially including software development packages or the other suggested categories?

jdblischak commented 4 years ago

Since our task view was merged with the ReproducibleResearch Task View, the workflow packages are now included with packages on dependency management and archiving, so that worked out well.