Junkdnalab / acg_2023

Resources for Applied Computational Genomics course at Cedars-Sinai (2023)
Creative Commons Attribution Share Alike 4.0 International
0 stars 3 forks source link

Intro to Applied Computational Genomics

Note: This Syllabus is subject to change.

Course Abstract

The goal of this course is to initiate wetlab biologists on the Ph.D. track to some basic skills in bioinformatics and computational genomics, and acquire basic computer literacy needed to perform analyses of next generation sequencing experiments. We will attend fundamental concepts in the analysis and interpretation of genomics data with a focus on single-cell and microbiome technologies, and explore the structure of key data types and their associated file formats. Twelve lectures is far to short a course to be considered comprehensive. Nonetheless we strive to empower students with basic skills to advocate for and analyze their own data.

Course Format

The course takes place over 12 weeks with 1.5 hour lectures on Fridays 10:30 AM - 12:00 PM. Most lectures are accompanied by either readings or some light homework or both. Any assignments must be completed on time for full credit, no exceptions. There is no final exam.

Expectations: Attendance, Homework & Grading

Late assignments will be given a maximum of 50% of full credit up to one lecture after the original due date. When homework is not assigned, points will be given for attendance. Readings are indicated on the syllabus. Each question set is worth 10 points and will be graded based on completeness. Links to lecture slides and readings will be provided in the syllabi below.

Attendance is required no exceptions. You may obtain permission ahead of time or with extenuating circumstances after the fact from the graduate school (email to Emma Yates Kassler). Homework assignments are required on time (see schedule below) regardless of attendance or for half credit one lecture late. Unexcused absences result in an incomplete grade.

After each lecture, a link will be posted to provide feedback. Attendance will be recorded by the completion of the "5 minute feedback". There will be three questions, with space for brief answers:

Frequently asked questions or topics of confusion will be addressed in the subsequent lecture.

Lecture & assignment schedule:


May 19th History of Bioinformatics (Hazelett)

Lecture topics:

Lecture slides:

Lecture 01

readings:

feedback:

5 minute feedback


May 26th Basic Unix Skills (Hazelett)

Lecture topics:

Lecture slides:

Lecture 02 - starts slide 36

feedback:

5 minute feedback

Homework Assignment 1:

Due June 2nd


June 2nd Version control with Git (Hazelett)

Lecture topics:

Lecture slides:

Lecture 3

Readings:

Resources:

Coursera

feedback:

5 minute feedback

Homework Assignment 2:

Due June 9th


June 9th Principle components analysis, heatmaps and clusterProfiler (Hazelett)

Lecture topics:

Lecture slides:

Lecture 4

Readings:

Principal components analysis primer (2022)

the dendsort paper

Resources:

PCA explained

Step-by-step PCA (same author)

UMAP explained

feedback:

5 min feedback

Homework Assignment 3:

Due June 16th


June 16th Genomics File Formats (Coetzee)

Lecture topics:

Lecture slides:

Lecture 04

feedback:

5 min feedback

Homework Assignment 4:

Due June 23rd


June 23rd Working with data: Searching NGS File Formats and Using Genome Arithmetic (Coetzee)

Lecture topics:

Lecture slides:

Lecture 05

Lecture 05 markdown

feedback

5 min feedback

Homework Assignment 5:

Due June 30th


June 30th Basics of Alignment and Quality Control (Coetzee)

Lecture topics:

Lecture slides:

Lecture 06

Lecture 06 markdown

feedback

5 min feedback

Additional files

Differential Expression Page

Differential Expression Rmd

MultiQC Report

Homework Assignment 6:

Due July 7th


July 7th scRNA-seq I: analysis of a single sample (Coetzee)

Lecture topics:

Lecture slides:

Lecture 08

Lecture 08 markdown

feedback

5 min feedback

Readings and websites:


July 14th scRNA-seq II: analysis of a patient cohort (Coetzee)

Lecture topics:

Lecture Slides:

Lecture 09

Lecture 09 markdown

feedback

5 min feedback

Readings and websites:

Homework Assignment 7:

Due July 28th


July 21st BREAK


July 28th Analysis of the Microbiome, part I (Vujkovic-Cvijin)

Lecture topics:

16S rRNA sequence data processing and analysis

Homework Assignment 8: Follow and complete the tutorial for dada2 prior to July 28th. Please also install the following R packages: vegan, phyloseq, lmerTest, lme4, ggplot2, dplyr, ape, reshape2

https://www.dropbox.com/scl/fo/eepu0rvg25n4ycml7livv/h?rlkey=yedzeixm8192u9wn9zgks3fux&dl=0

5 min feedback July 28

lecture slides

Microbiome Analysis Part 1 Homework, due August 15


Aug 4th Analysis of the Microbiome, part II (Vujkovic-Cvijin)

5 min feedback Aug 4

lecture slides

Microbiome Analysis Part 2 Homework


August 11th Visualization and Color Palettes (Coetzee)

Lecture topics:

Lecture slides:

Lecture 12

5 minute feedback

Overall Course Feedback

Helpful Books on the Subject:

Schedule and Due Dates:

day date lecturer topic due
Friday 05/19 HAZELETT History of Bioinformatics
Friday 05/26 HAZELETT Basic Unix Skills
Friday 06/02 HAZELETT Version Control & Git Assignment 1
Friday 06/09 HAZELETT PCA, heatmaps, clusterProfiler Assignment 2
Friday 06/16 COETZEE NGS File Formats Assignment 3
Friday 06/23 COETZEE Searching in files & Genome Arithmetic Assignment 4
Friday 06/30 COETZEE Advanced Search Assignment 5
Friday 07/07 COETZEE scRNA-seq I (single sample) Assignment 6
Friday 07/14 COETZEE scRNA-seq II (patient cohort)
Friday 07/21 BREAK Assignment 8
Friday 07/28 VUJKOVIC-CVIJIN Microbiome I Assignment 7
Friday 08/04 VUJKOVIC-CVIJIN Microbiome II Assignmnet 8
Friday 08/11 COETZEE Visualization & Color Assignment 9