Closed frqc closed 4 years ago
use rdd.map(x => (x, 1L)).reduceByKey( + ) instead of countByKey() as hinted by comments in spark
Could you add the documentation for the method? It can be a straight port from the original method, specially the note is important.
Thanks!
Yep remove unnecessary box and add a short comment
use rdd.map(x => (x, 1L)).reduceByKey( + ) instead of countByKey() as hinted by comments in spark