You have an Apache Spark cluster in Azure HDInsight.You plan to join a large table and a lookup table.You need to minimi

Business, Finance, Economics, Accounting, Operations Management, Computer Science, Electrical Engineering, Mechanical Engineering, Civil Engineering, Chemical Engineering, Algebra, Precalculus, Statistics and Probabilty, Advanced Math, Physics, Chemistry, Biology, Nursing, Psychology, Certifications, Tests, Prep, and more.
Post Reply
answerhappygod
Site Admin
Posts: 899604
Joined: Mon Aug 02, 2021 8:13 am

You have an Apache Spark cluster in Azure HDInsight.You plan to join a large table and a lookup table.You need to minimi

Post by answerhappygod »

You have an Apache Spark cluster in Azure HDInsight.You plan to join a large table and a lookup table.You need to minimize data transfers during the join operation.What should you do?

A. Use the reduceByKey function
B. Use a Broadcast variable.
C. Repartition the data.
D. Use the DISK_ONLY storage level.
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!

This topic has 1 reply

You must be a registered member and logged in to view the replies in this topic.


Register Login
 
Post Reply