dakusui / bredxbred

mapreduce in bash
Other
4 stars 1 forks source link

Execute awk function in a reduce task #6

Closed dakusui closed 9 years ago

dakusui commented 9 years ago

Currently reduce task is written as general shell script and you can specify your favorite interpreter. And in bred's reducer the code you give will be executed using the interpreter for each key. But since generally speaking there are a lot of keys processed in a reducer, this means so many context switches (external command executions) will be made. Making it possible to call a user defined awk function from inside reducer, which is written in an awk string, would be performance-wise very benefitial.

dakusui commented 9 years ago

Implemented in this commit. d15481bec5ecff185351455ade8e2cdf617f8968