vpodgurschi-mm / rap-external-docs

papers, articles etc on big data, databases, streaming/batch processing etc
0 stars 0 forks source link

docs #1

Open vpodgurschi-mm opened 8 years ago

vpodgurschi-mm commented 8 years ago

http://highscalability.com/blog/2015/12/9/free-red-book-readings-in-database-systems-5th-edition.html https://www.dropbox.com/sh/9wlcv9shyijfa3k/AAByp0jIRDdxYWP_xYjYAE3xa?dl=0

vpodgurschi-mm commented 8 years ago

apache flink - distributed stream processing

selections from the talks of the recent FlinkForward conference http://flink-forward.org/?post_type=session

vpodgurschi-mm commented 8 years ago

Excellent talks on streaming data processing, and Apache Flink

Volker Markl: Big Data Management and Apache Flink https://www.youtube.com/watch?v=iL8plQ3vWf8 -> origins of Flink (Stratosphere); shortcomings of MapReduce; memory management, out-of-core, delta iterations, cyclic data processing graphs (iterations), schema-on-read etc

Seif Haridi: Apache Flink Streaming https://www.youtube.com/watch?v=P4ERkq0rnoI http://www.slideshare.net/dato-inc/sics-apache-flink-streaming?qid=7db7d379-29b9-4d8a-aa7b-ee857e117164 -> great touch on all aspects, including fault tolerance, flow control, windowing

Matthias Sax: Feeding a Squirrel in Time - Windows in Flink https://www.youtube.com/watch?v=ZVZyg0aWbMU -> comparison with other streaming systems; windowing; code samples

Fabian Hueske: Apache Flink Batch Advanced https://www.youtube.com/watch?v=1yWKZ26NQeU

Paris Carbone: Data Stream Processing with Apache Flink https://www.youtube.com/watch?v=L3DLV0GtMSM -> more generic talk on distributed streaming processing: windowing, state (synopsis), partitioning; comparison of systems

Gyula Fora: Building Stream Processing Pipelines https://www.youtube.com/watch?v=ONKDfgrE9ng