comp-machine-learning-spring2021 / portfolio-krhablutzel

Machine learning (CSC294) final portfolio, including supervised, unsupervised, and deep learning
0 stars 0 forks source link

Kathleen Hablutzel's Machine Learning Portfolio

CSC 294 Computational Machine Learning

Spring 2021

Hello and welcome to my machine learning portfolio! I'm wrapping up my sophomore year with a portfolio to showcase the skills I've picked up in my machine learning course this semester. This portfolio is a final project for the CSC294 Computational Machine Learning course at Smith College under Dr. Katherine Kinnaird, but I intend for this portfolio to be a useful reference for my future machine learning projects, as well as a preview of some of my machine learning skills for potential employers. Thoughout the project, I explore a weather dataset pulled from NOAA's Global Surface Summary of the Day for the Raleigh-Durham International Airport (RDU) in order to explore various aspects of the weather in the region I call home, the Research Triangle of North Carolina.

In this porfolio, I've chosen to highlight three (four) broad areas of experience that we covered this semester:

  1. Data Wrangling: dimension reduction (PCA), pandas, numpy, matplotlib, seaborn
  2. Clustering: k-Means clustering, spectral clustering, benchmarking, sklearn
  3. Classification: random forests, decision trees, k-nearest neighbors (kNN), cross-validation, sklearn
  4. Deep Learning: neural networks, recurrent neural networks (RNNs), tensorflow

We covered many foundational machine learning techniques this semester, and many of them found their way into this portfolio. The only major techniques we covered that are missing from this portfolio are Singular Value Decomposition (SVD), Gradient Descent, Support Vector Machines (SVM), perceptrons, and Convolutional Neural Networks (CNNs). I've enjoyed taking a deep dive into all of these techniques this semester, both applying them with sklearn/tensorflow and implementing many of them in Python from scratch!

As a Computer Science + Statistical and Data Sciences double major and Zollman Scholar at Smith College ('23), I hope to continue applying many of these machine learning techniques in my future endeavors. I currently work as a software engineer in the Grubb Lab through the STRIDE research program, and I'm an incoming data engineer at Bank of America for Summer 2021. I'm excited to have all these machine learning techniques in my toolkit for future projects! Outside machine learning, my interests include data science, networks, cryptography, and complex systems, as well as running, theater, and 16th century British politics.