This repository contains hands-on labs and data files for a full-day Apache Spark workshop. This file is also the index for the hands-on labs.
Lab 0 - Python Fundamentals
Lab 1 - Multi-File Word Count
Lab 2 - Analyzing Flight Delays
Lab 3 - Analyzing Startup Companies
Lab 4 - Analyzing UK Property Prices
Lab 5 - Streaming Tweet Analysis
Lab 6 - PageRank over Movie References
Lab 7 - Plagiarism Detection
Lab 0 - Scala Fundamentals
Copyright (C) Sasha Goldshtein, 2016. All rights reserved.