Page 1 of 1

You have an Apache Spark cluster in Azure HDInsight.You plan to join a large table and a lookup table.You need to minimi

Posted: Wed Aug 17, 2022 7:02 am
by answerhappygod
You have an Apache Spark cluster in Azure HDInsight.You plan to join a large table and a lookup table.You need to minimize data transfers during the join operation.What should you do?

A. Use the reduceByKey function
B. Use a Broadcast variable.
C. Repartition the data.
D. Use the DISK_ONLY storage level.