joyhuang9473 / reading-list

0 stars 1 forks source link

Apache:Spark computing system #36

Closed joyhuang9473 closed 8 years ago

joyhuang9473 commented 8 years ago

Apache:Spark computing system

Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized engine that supports general computation graphs for data analysis. It also supports a rich set of higher-level tools including Spark SQL for SQL and DataFrames, MLlib for machine learning, GraphX for graph processing, and Spark Streaming for stream processing.