mahmoudparsian / data-algorithms-book

MapReduce, Spark, Java, and Scala for Data Algorithms Book
http://mapreduce4hackers.com
Other
1.07k stars 666 forks source link
apache-hadoop apache-spark data-algorithms design-patterns distributed-algorithms distributed-computing hadoop-mapreduce java machine-learning mappers mapreduce partitioning pyspark python reducers scala

Data Algorithms Book

Git Repository

The book's codebase can also be downloaded from the git repository at:

git clone https://github.com/mahmoudparsian/data-algorithms-book.git

2nd Edition! Coming Out @ the End of 2021

Upgraded to Spark-3.1.2

Production Version is Available NOW!

Data Algorithms Book

Java 8's LAMBDA Expressions to Spark...

Scala Spark Solutions

How To Build using Apache's Ant

How To Build using Apache's Maven

Machine Learning Algorithms using Spark

Spark for Cancer Outlier Profile Analysis

Webinars and Presentions on Data Algorithms

Introduction to MapReduce

Bonus Chapters

Author Book Signing

[How To Run Spark/Hadoop Programs](./misc/run_spark/README.md) ================================== [Submit a Spark Job from Java Code](./misc/how-to-submit-spark-job-from-java-code.md) =========================================== How To Run Python Programs ========================== To run python programs just call them with `spark-submit` together with the arguments to the program. [My favorite quotes...](./misc/favorite_quotes/README.md) ========================================================= Questions/Comments ================== * [View Mahmoud Parsian's profile on LinkedIn](http://www.linkedin.com/in/mahmoudparsian) * Please send me an email: * [Twitter: @mahmoudparsian](http://twitter.com/mahmoudparsian) Thank you! ```` best regards, Mahmoud Parsian ```` [![Data Algorithms Book](./misc/large-image.jpg)](http://shop.oreilly.com/product/0636920033950.do)