mauro3 / CORDS

Other
4 stars 3 forks source link

Workshops and materials on reproducible research and geodata processing

This repository holds the teaching material for two workshops we held at WSL in June 2024 (maybe there will be future editions too). They were conducted in the framework of the CORDS project, funded by WSL.

The provided study material is complete, thus enabling the self-study of these two courses.

image

Figure: example of a proposed main-script to reproduce all research outputs (left), example map incorporating geodata processing of both raster and vector data (right)

The two workshops were:

We also planned to run a workshop on GPU computing but in the end we didn't. However, if interested we refer you to other workshops and resources of ours (by Ludovic Räss @luraess & Ivan Utkin @utkinis), on that topic[^gpu-resources].

The resources here in this Github repository are intended to be sufficient to self-study the material. Indeed, we intend to let our future PhD students work through them when they start. The material consists of:

How to self study

There are (at least) two ways to self study this material:

  1. detailed study: watching the lectures and reading the material and working through the hands-on exercises
  2. cursory study: looking through the lecture slides and the solutions to the hands-on exercises

The Reproducible Research material will take about 1.5 to 3 days using approach (1) or about 1/2 day using (2).

The Geodata Processing material will take about 1/2 to 1 day using approach (1) or about 3 hours using (2)

Reproducible Research Course

This workshop/material is intended for students and scientists handling data and/or running simulations aiming to make their programming and data workflow reproducible.

Contents:

Get started

Note that the taught approach deliberately favors a straightforward, minimalistic and low-tech strategy for achieving reproducible research. It is based on only essential tools: the programming language, a dependency management tool, and version control with Git. We feel that once the concepts are mastered and if the researcher's work warrants it (very large or very many data dependencies, very many processing steps), then more advanced and integrated tools are easily learned.

Geodata Processing in Python Course

This workshop/material is intended for scientists and students wanting to move their interactive GIS work to fully scripted workflows.

Contents:

Get started

[^gpu-resources]: Some resources of ours on GPU computing: our master level course on solving PDEs on GPUs at ETH "Solving PDEs on GPUs"; talk and workshop about GPU, Autodiff and inversions (talk, workshop).