Anyone who wants to explore their data in R with the tidyverse. This
includes all analysts at 2DII and beyond.
This chapter will show you how to use visualisation and transformation
to explore your data in a systematic way, a task that statisticians
call exploratory data analysis, or EDA for short. EDA is an iterative
cycle. You:
Generate questions about your data.
Search for answers by visualising, transforming, and modelling
your data.
Use what you learn to refine your questions and/or generate new
questions.
Why is this important?
EDA is an important part of any data analysis, even if the questions
are handed to you on a platter, because you always need to investigate
the quality of your data. Data cleaning is just one application of EDA:
you ask questions about whether your data meets your expectations or
not. To do data cleaning, you'll need to deploy all the tools of EDA:
visualisation, transformation, and modelling.
Who is the audience?
Anyone who wants to explore their data in R with the tidyverse. This includes all analysts at 2DII and beyond.
Why is this important?
What should be covered?
Resources
Checklist
2h before
[x] Open meetup issue, paste this template and edit as necessary
[x] Announce on #coding
[x] Add meetup date, title and link to the invitation on 2dii's calendar
10' before
[x] Announce on #coding:
The ds-incubator starts in 10'; please join early so we start on time. The meeting room is already open at:
<LINK WITH PASSWORD>
Start