Page 1 of 1

You've written a MapReduce job that will process 500 million input records and generated 500 million key-value pairs. Th

Posted: Sat Jul 23, 2022 7:29 pm
by answerhappygod
You've written a MapReduce job that will process 500 million input records and generated 500 million key-value pairs. The data is not uniformly distributed. Your
MapReduce job will create a significant amount of intermediate data that it needs to transfer between mappers and reduces which is a potential bottleneck. A custom implementation of which interface is most likely to reduce the amount of intermediate data transferred across the network?

A. Partitioner
B. OutputFormat
C. WritableComparable
D. Writable
E. InputFormat
F. Combiner