arunma / ScalaDataAnalysisCookbook

Code for Packt Publishing's Scala Data Analysis Cookbook.
https://www.packtpub.com/big-data-and-business-intelligence/scala-data-analysis-cookbook
49 stars 52 forks source link

Scala Data Analysis Cookbook

Samples for Packt Publishing's Scala Data Analysis Cookbook

The samples in this project were written with JDK 1.7, Scala 2.10.4 and SBT .13.8

1.  Getting started with Breeze - Vectors, Matrices and RNGs
2.  Getting started with Spark - DataFrames, Vectors and Matrices
3.  Data Loading and Preparation - Spark DataFrames
4.  Data Visualization with Zeppelin and Bokeh Scala
5.  Learning from Data - Spark MLLib (Linear Regression, Classification, Clustering and PCA)
6.  Scaling up - Deploying Spark on Standalone cluster, EC2, Mesos and YARN
7.  Going further - Streaming from Twitter, Kafka, Streaming Logistic Regression and Twitter CC analysis using GraphX