alan-turing-institute / rds-course

Materials for Turing's Research Data Science course
https://alan-turing-institute.github.io/rds-course/
31 stars 13 forks source link

Module 2 Delivery (2023) #162

Open jack89roberts opened 1 year ago

jack89roberts commented 1 year ago

Taught Material

Rough timings:

Section  Title  Start Time notes
Overview  13:05  
2.1.1 Where to find data
2.1.2 Legality and Ethics
2.1.3 Pandas intro 13:26 as in 2021 this is a rapid switch from legal/ethics issues to technical, could do with a pause/breather/smoother transition somehow
SHORT BREAK 13:48 (after ~10 mins of questions)  
2.1.4 Data sources & formats 14:00
2.1.5 Controlling access  
2.2.1 Data consistency up to null values
LONG BREAK 14:53 (after ~10 mins of questions)
2.2.1 Data consistency 15:15 from null values
2.2.2 Modifying columns & indices  
2.2.3 Feature engineering rushed through (from binning onwards
2.2.4.1 Time & Date rushed through 
2.2.4.2 Text Data rushed through
SHORT BREAK 16:02  
2.2.4.3 Categorical Data 16:15  
2.2.4.4 Image Data  
2.2.5 Privacy & Anonymisation  
2.2.6 Linking Datasets  
2.2.7 Missing Data  
  Wrap-up (final Qs, pre-reqs for hands-on)  
  End 17:02 (after ~10 mins of questions)  

Hands-on