goldshtn / spark-workshop

Labs and data files for a full-day Spark workshop
MIT License
24 stars 23 forks source link

Spark Workshop

This repository contains hands-on labs and data files for a full-day Apache Spark workshop. This file is also the index for the hands-on labs.


Python Labs

  1. Lab 0 - Python Fundamentals

  2. Lab 1 - Multi-File Word Count

  3. Lab 2 - Analyzing Flight Delays

  4. Lab 3 - Analyzing Startup Companies

  5. Lab 4 - Analyzing UK Property Prices

  6. Lab 5 - Streaming Tweet Analysis

  7. Lab 6 - PageRank over Movie References

  8. Lab 7 - Plagiarism Detection


Scala Labs (under development)

  1. Lab 0 - Scala Fundamentals

  2. Lab 1 - Multi-File Word Count

  3. Lab 2 - Analyzing Flight Delays

  4. Lab 3 - Analyzing Startup Companies

  5. Lab 4 - Analyzing UK Property Prices

  6. Lab 6 - PageRank over Movie References


Copyright (C) Sasha Goldshtein, 2016. All rights reserved.