eecs485staff / madoop

A light weight MapReduce framework for education
MIT License
9 stars 4 forks source link

Group using hash and sort #12

Closed awdeorio closed 2 years ago

awdeorio commented 2 years ago

The current implementation relies on CLI sort. Used Python's sort instead.

EDIT: it would be even better to improve the group strategy to use hash(key) % num_reducers