You have an Apache Spark cluster in Azure HDInsight.You plan to join a large table and a lookup table.You need to minimize data transfers during the join operation.What should you do?
A. Use the reduceByKey function
B. Use a Broadcast variable.
C. Repartition the data.
D. Use the DISK_ONLY storage level.
You have an Apache Spark cluster in Azure HDInsight.You plan to join a large table and a lookup table.You need to minimi
-
answerhappygod
- Site Admin
- Posts: 899604
- Joined: Mon Aug 02, 2021 8:13 am
You have an Apache Spark cluster in Azure HDInsight.You plan to join a large table and a lookup table.You need to minimi
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!