Open vpodgurschi-mm opened 8 years ago
apache flink - distributed stream processing
selections from the talks of the recent FlinkForward conference http://flink-forward.org/?post_type=session
Excellent talks on streaming data processing, and Apache Flink
Volker Markl: Big Data Management and Apache Flink https://www.youtube.com/watch?v=iL8plQ3vWf8 -> origins of Flink (Stratosphere); shortcomings of MapReduce; memory management, out-of-core, delta iterations, cyclic data processing graphs (iterations), schema-on-read etc
Seif Haridi: Apache Flink Streaming https://www.youtube.com/watch?v=P4ERkq0rnoI http://www.slideshare.net/dato-inc/sics-apache-flink-streaming?qid=7db7d379-29b9-4d8a-aa7b-ee857e117164 -> great touch on all aspects, including fault tolerance, flow control, windowing
Matthias Sax: Feeding a Squirrel in Time - Windows in Flink https://www.youtube.com/watch?v=ZVZyg0aWbMU -> comparison with other streaming systems; windowing; code samples
Fabian Hueske: Apache Flink Batch Advanced https://www.youtube.com/watch?v=1yWKZ26NQeU
Paris Carbone: Data Stream Processing with Apache Flink https://www.youtube.com/watch?v=L3DLV0GtMSM -> more generic talk on distributed streaming processing: windowing, state (synopsis), partitioning; comparison of systems
Gyula Fora: Building Stream Processing Pipelines https://www.youtube.com/watch?v=ONKDfgrE9ng
http://highscalability.com/blog/2015/12/9/free-red-book-readings-in-database-systems-5th-edition.html https://www.dropbox.com/sh/9wlcv9shyijfa3k/AAByp0jIRDdxYWP_xYjYAE3xa?dl=0