Currently bred's output goes back to the master node when each job finishes and this is not ideal implementation.
A mechanism where a map job stores data set on the host on which it is running and following reduce jobs which run on the same host can reuse them would be preferrable.
Currently bred's output goes back to the master node when each job finishes and this is not ideal implementation. A mechanism where a map job stores data set on the host on which it is running and following reduce jobs which run on the same host can reuse them would be preferrable.