posit-conf-2023 / dataviz-ggplot2

1-day workshop on advanced ggplot2 for posit::conf(2023) by Cédric Scherer
https://posit-conf-2023.github.io/dataviz-ggplot2/
Other
43 stars 26 forks source link
data-visualization ggplot2 posit positconf positconf2023 r rstats visualization workshop

Engaging and Beautiful Data Visualizations with ggplot2

posit::conf 2023

by Cédric Scherer


:spiral_calendar:  September 18, 2023
:alarm_clock:  09:00 - 17:00
:hotel:  Acapulco
:link:  posit-conf-2023.github.io/dataviz-ggplot2


Overview

Description

Creating effective and easily accessible data visualizations of high quality in an efficient and preferably reproducible way is an essential skill for everyone working in a data-related field. Luckily, by leveraging the functionality of ggplot2, the most famous package for data visualization with R, and related extension packages one can create highly customized data visualization without the need for post-processing.

This workshop provides everything one needs to know to create and customize numerous chart types with ggplot2. Participants will learn the most important steps and helpful tips to create visually appealing and informative graphics with a code-only approach. The power of ggplot2 and related extension packages will be illustrated with advanced real–life examples that help to understand useful coding tricks and the process of creating engaging and effective visualizations. The workshop will particularly focus on more advanced tasks with ggplot2 such as styling labels and titles, customizing themes and visual aesthetics, and using less-common chart types.

Audience

This course is for you if you:

Schedule

Time Activity
09:00 - 10:30 Fundamentals & Workflows
10:30 - 11:00 Coffee break
11:00 - 12:30 Working with Text
12:30 - 13:30 Lunch break
13:30 - 15:00 Working with Colors
15:00 - 15:30 Coffee break
15:30 - 17:00 Working with Themes

Prework

As this workshop focuses on (more) advanced topics, a solid understanding of the fundamentals of ggplot2 and also data preparation in R is helpful. The concepts of how to initialize a ggplot, map variables to aesthetics, and how aesthetics relate to scales should be familiar to you. If you want to refresh your knowledge, I suggest going through the first two chapters of the ggplot2 book by Hadley Wickham et al. (introduction and getting started). The “R for Data Science” book also offers a good overview of ggplot2 as well as on data wrangling with the tidyverse.

You will need to bring your own laptop to the workshop with a recent version of R (4.3.1 or newer) and RStudio (2023.06.1 or newer) as well as a bunch of R packages and a few typefaces installed.

Install R and Rstudio:

Download the most recent version of R (cloud.r-project.org, at least version 4.2) and Rstudio (rstudio.com/products/rstudio/download, at least version 2023.06.1) and follow the installation steps.

Install R Libraries

You can install all required R packages at once by running the following code in the R command line:

pkgs <- c("ggplot2", "dplyr", "tibble", "tidyr", "readr", "forcats", "stringr", "lubridate", "here", "scales", "ragg", "systemfonts", "rcartocolor", "scico", "prismatic", "patchwork", "ggtext", "ggforce", "ggrepel")
unavailable <- setdiff(pkgs, rownames(installed.packages()))
install.packages(unavailable)

To run the code, open up Rstudio. Copy–paste the code in the console pane (by default in the lower left) and hit enter. Several messages should pop up. Scan these messages that are returned for errors and troubleshoot them if necessary. Warnings and other informational messages can be ignored.

A few packages are not strictly needed but are used in some of the course material. If you wish to install all packages, run the following code as well:

pkgs_opt <- c("viridis", "RColorBrewer", "MetBrewer", "ggthemes", "ggsci", "camcorder", "colorspace", "remotes")
unavailable <- setdiff(pkgs_opt, rownames(installed.packages()))
install.packages(unavailable)
remotes::install_github("clauswilke/colorblindr")

If, for some reason, you cannot install these packages or the latest versions of R and RStudio, we will have a Posit Cloud workspace set up that can be used instead. But you will still need to bring your laptop.

Install Typefaces

We will make use of custom fonts during the course. If you want to follow all contents locally, make sure the following typefaces are installed:

You can download all typefaces as a single zip file here.

For Mac OS Users

If you want to save your visualization to PDF, please make sure that XQuartz is installed which is needed to use the cairo pdf device.

Course Communication

We will be using Discord as our main communication method during the workshop. To make the process go smoothly:

Closer to the start of the conference, you will be invited to join the posit::conf Discord server. Once you’ve accepted the invite, we will add you to the channel(s) for the workshop.

Instructor

Dr Cédric Scherer is a data visualization designer, consultant, and instructor helping clients and workshop participants to create engaging and effective graphics. As a graduated ecologist, he has acquired extensive hypothesis–driven research experience and problem–solving expertise in data wrangling, statistical analysis, and model development. As an independent data visualization designer, Cédric later combined his expertise in analyzing large data sets with his passion for design, colors, and typefaces. Cédric has designed graphics across all disciplines, purposes, and styles applying a code–first approach, and regularly talks about data visualization design and ggplot2 techniques. Due to regular participation in social data challenges such as #TidyTuesday, he is now well known for complex and visually appealing figures, entirely made with ggplot2, that look as if they have been created with a vector design tool.

BlogEmailTwitterInstagramBehanceLinkedIn   Buy Me A Coffee


This work is licensed under a Creative Commons Attribution 4.0 International License.