rajasekarv / vega

A new arguably faster implementation of Apache Spark from scratch in Rust
Apache License 2.0
2.23k stars 206 forks source link

Tracking issue: Implementation of lacking core RDD ops #55

Open iduartgomez opened 4 years ago

iduartgomez commented 4 years ago

For core RDD ops we understand those which spawn in the original Apache Spark from SparkContext and/or the base RDD class and friends: SC:

Non-goals for this tracking issue are any I/O related ops as we are tracking those elsewhere and doing things a little bit differently:

iduartgomez commented 4 years ago

Intersection completed in #66

iduartgomez commented 4 years ago

range done in #82

GavrielPlotke commented 4 years ago

@iduartgomez - Isn't substract a misspelling of subtract ?

rajasekarv commented 4 years ago

fixed @GavrielPlotke

ajprabhu09 commented 4 years ago

what would the subtract operation entail, can someone give an example?

GavrielPlotke commented 4 years ago

Doc: https://spark.apache.org/docs/1.0.2/api/java/org/apache/spark/rdd/RDD.html#subtract(org.apache.spark.rdd.RDD) Example: