csch0lz / UsingRTutorials2425

Tutorials for the Using R course at UvA
MIT License
0 stars 0 forks source link

Session 4: check overview slide #7

Closed csch0lz closed 2 months ago

csch0lz commented 4 months ago

Check slide "Overview tidying steps"

xgaouva commented 3 months ago

My comments in (Italic):

Overview Data Tidying Steps

This concludes data tidying.

csch0lz commented 2 months ago

Thanks @xgaouva ! I made some adjustments. Couple of responses to your comments:

  1. They should check primary keys before doing anything to the dataset. That is part of determining what the structure of their dataset is. If they don't understand exactly what is in their data and how its structure, they can't tidy it effectively.
  2. Checking for duplicate rows is part of checking for a primary key (i.e. there is not primary key if there are duplicates).
  3. Joining is not part of data tidying. Data tidying ends with separate dibbles for different types of data. At the end of tidying, they should be able to describe how different data sources CAN be joined, but the actual joining would be part of a specific analysis, not the data tidying process.