add count by value - Githubissues

rajasekarv / vega

A new arguably faster implementation of Apache Spark from scratch in Rust

Apache License 2.0

2.23k stars 206 forks source link

Closed frqc closed 4 years ago

frqc commented 4 years ago

use rdd.map(x => (x, 1L)).reduceByKey( + ) instead of countByKey() as hinted by comments in spark

iduartgomez commented 4 years ago

Could you add the documentation for the method? It can be a straight port from the original method, specially the note is important.

Thanks!

frqc commented 4 years ago

Yep remove unnecessary box and add a short comment