Closed habernal closed 8 years ago
See https://support.pivotal.io/hc/en-us/articles/202810986-Mapper-output-key-value-NullWritable-can-cause-reducer-phase-to-move-slowly
Confirmed. Runs faster with intermediate compression; no single-reducer bottleneck for certain keys observed (run over entire CommonCrawl)
See https://support.pivotal.io/hc/en-us/articles/202810986-Mapper-output-key-value-NullWritable-can-cause-reducer-phase-to-move-slowly