You have an Apache Spark cluster in Azure HDInsight.You plan to join a large table and a lookup table.You need to minimi
Posted: Wed Aug 17, 2022 7:02 am
You have an Apache Spark cluster in Azure HDInsight.You plan to join a large table and a lookup table.You need to minimize data transfers during the join operation.What should you do?
A. Use the reduceByKey function
B. Use a Broadcast variable.
C. Repartition the data.
D. Use the DISK_ONLY storage level.
A. Use the reduceByKey function
B. Use a Broadcast variable.
C. Repartition the data.
D. Use the DISK_ONLY storage level.