You've written a MapReduce job that will process 500 million input records and generated 500 million key-value pairs. The data is not uniformly distributed. Your
MapReduce job will create a significant amount of intermediate data that it needs to transfer between mappers and reduces which is a potential bottleneck. A custom implementation of which interface is most likely to reduce the amount of intermediate data transferred across the network?
A. Partitioner
B. OutputFormat
C. WritableComparable
D. Writable
E. InputFormat
F. Combiner
You've written a MapReduce job that will process 500 million input records and generated 500 million key-value pairs. Th
-
answerhappygod
- Site Admin
- Posts: 899604
- Joined: Mon Aug 02, 2021 8:13 am
You've written a MapReduce job that will process 500 million input records and generated 500 million key-value pairs. Th
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!