Caltech CS 155 Winter 2018
Code for projects in Caltech's CS155: Machine Learning and Data Mining, developed by team Aw Young Qingzhuo, Ola Kalisz, and Riley Patterson.
The first project is a Kaggle competition to detect sentiment in Amazon reviews. Details on the task are provided on the Kaggle project page, and our code is in the src directory, in particular in a jupyter notebook for training individual models and in this source file defining a stacked neural net using these results.
Initial summaries of results for individual models were tracked in this issue, and further results will be in project report.
The second project involves finding matrix factorizations for the MovieLens dataset and projecting it into two dimensions for visualization and interpretation of how and why movies differ from each other.
We applied three different implementations of matrix factorization:
For each of these, we produced 2D visualizations with various colorings and labelings to help infer what the two extracted dimensions correlated to.
The code for this project is in the source directory for project2, and the results are found in the project2 report directory.