Closed awdeorio closed 2 years ago
The current implementation relies on CLI sort. Used Python's sort instead.
sort
EDIT: it would be even better to improve the group strategy to use hash(key) % num_reducers
hash(key) % num_reducers
The current implementation relies on CLI
sort
. Used Python's sort instead.EDIT: it would be even better to improve the group strategy to use
hash(key) % num_reducers