datacarpentry / genomics-r-intro

Intro to R and RStudio for Genomics
https://datacarpentry.org/genomics-r-intro/
Other
28 stars 89 forks source link

Why cloud R? #105

Closed NadineBestard closed 10 months ago

NadineBestard commented 3 years ago

The content of this lesson is an abreviate version of R for Reproducible scientific Analysis with a focus in genomics. However, it is very dependent on the shell lessons intro to understand the data and the AWS setup.

In many other Carpentries lessons there is a big focus in getting students ready to go on their own computers. Recently a carpentries-discuss highlited this as an argument to keep having students install the programs/packages needed instead of using docker images or similar (that could make teaching easier but is less beneficial for the learners).

I had a close look at this lesson and I do not see anything that especially requires big computational steps (unlike genomics-shell). Why not teaching this as R for Reproducible scientific Analysis?

With some modifications this lesson could even be stand alone. Just teaching R, for people that would like to learn R but is already familiar with the command line and cloud computing.

I understand this might suppose a big change, at least a first step would add some sections to make it easier to separate from the rest of the Genomics lessons:

naupaka commented 2 years ago

@NadineBestard the vision is to have this be at the end of the shell-genomics lessons in a sequence, so the data output from those lessons are the inputs to these (since this is a Data Carpentry lesson vs a Software Carpentry lesson, there is more emphasis on integration vs modularity of content). Hence, the use of RStudio Server instead of local R. In a real genomics processing workflow where datasets are larger than the ones used here, there might be a good reason to do the R stuff on a remote server as well.

naupaka commented 10 months ago

Thank you for the feedback. The goal of this set of lessons (Data Carpentry Genomics) is specifically to train folks to work on cloud servers. I would direct people who want to have materials focused on doing things locally to the other R lessons hosted by Software Carpentry. I am going to close this since unfortunately we are not planning to change the whole scope for this material.