nathanmarz / cascalog

Data processing on Hadoop without the hassle.
Other
1.38k stars 179 forks source link

Parallel aggregation performance sucks #230

Closed ipostelnik closed 10 years ago

ipostelnik commented 10 years ago

ClojureMonoidAggregator de-serializes functions for every group rather than once in prepare().