alan-turing-institute / rds-course

Materials for Turing's Research Data Science course
https://alan-turing-institute.github.io/rds-course/
31 stars 13 forks source link

Meta-issue: Module 1 (taught) #25

Closed gmingas closed 2 years ago

gmingas commented 3 years ago

Outline of Module 1 (taught material):

This is a preliminary high-level outline of how module 1 (intro to research data science) will look like. Each section will have its own issue with more details and a timeline of development.

1. What is data science?

2. Project life cycle

Basic stages in a data science project and common hurdles in each stage. This lesson will contain multiple examples of real-world project situations to demonstrate common issues and ways to address them. It will focus on scoping, especially how a question can be translated to a technical task, the role of a data scientist in this and how to tackle ambiguity. Some of the material will overlap with Turing Commons lifecycle material.

3. Intro to EDI for data science

This module

4. Collaboration and reproducibility

How to work collaboratively in data science projects and reproducibility principles - partially using material from The Turing Way.

Resources

Tools

Useful books/references:

Connection to the hands-on session

In the hands-on section, we will apply some of the learnt principles to scope a research project, including interrogating purpose, methodology, data, EDI questions.

Duration of the module

4 hours including two 10 minute breaks and one 30 minute break

Schedule:

Time to write this module