jawilliams3000 / DAT_SF_12

0 stars 0 forks source link

DAT SF 12

<<<<<<< HEAD

Instructor:Alessandro Gagliardi
EiRs:Ramesh Sampath
Otto Stegmaier
Alex Chao
Classes:6:30pm-9:30pm, Tuesday and Thursdays
January 15 – March 31
Office Hours:Alex Chao, 5:30 - 6:30 before class at GA
Otto Stegmaier, 9:30 - 10:00 after class at GA
Ramesh Sampath, 4:00 - 6:00 Saturdays remote
Can also set by appointment

Homework is to be submitted by posting it to your own github repo. Then post the URL and folder where the homework lives at here.


Tentative Course Outline

  1. Intro to Data Science, Relational Databases & SQL
  2. Getting started with IPython & Git
  3. APIs and semi-structured data
  4. IPython.parallel & StarCluster
  5. Hadoop Distributed File System and Spark
  6. Intro to ML: k-Nearest Neighbor Classification
  7. Clustering: Hierarchical and K-Means
  8. Probability, A/B Tests & Statistical Significance
  9. Multiple Linear Regression and ANOVA
  10. Project Elevator Pitches
  11. Logistic Regression and Generlized Linear Models
  12. Time Series Analysis & Midterm Review
  13. Principal Components Analysis
  14. Text Mining & Naïve Bayes
  15. Nonlinear Models
  16. Grid Search and Parameter Selection
  17. Bringing it Together
  18. Final Project Working Session
  19. Final Project Working Session
  20. Final Project Presentations (12 min. each)
  21. Final Project Presentations (12 min. each)
  22. Future Directions

Project Schedule

Date Due Returned
1/22 Preliminary Project Proposals Due (3-4 sentences)
1/27 Homework 1
1/29 EiR Feedback on Project Proposals
2/3 EiR Feedback on Homework 1
2/5 Formal Proposals (including data and methods chosen)
2/10 Homework 2 Assigned
2/12 EiR Feedback on Formal Proposals
2/17 Homework 2 Due
2/19 Homework 3 Assigned and
Project Elevator Pitch in class (4 minutes each)
Project Live on Github
2/24 Homework 3 Due EiR Feedback on Homework 2
2/26 Peer Feedback of Projects Peer Feedback on Project
3/3 Peer Feedback of Homework 3 Peer Feedback on Homework 3
3/10 Midterm Assessment Due
3/17 At least one working model
3/24-26 Final Presentations (12 minutes each) Midterm Graded

======= Note: I am working through the feedback on my proposal and hope to have the formal proposal posted on 2/12/15

acf05634219dc9c9a09f522b1f5ffe1987a17e1f