Note: This question is part of a series of questions that present the same Scenario.Each question I the series contains

Business, Finance, Economics, Accounting, Operations Management, Computer Science, Electrical Engineering, Mechanical Engineering, Civil Engineering, Chemical Engineering, Algebra, Precalculus, Statistics and Probabilty, Advanced Math, Physics, Chemistry, Biology, Nursing, Psychology, Certifications, Tests, Prep, and more.
Post Reply
answerhappygod
Site Admin
Posts: 899604
Joined: Mon Aug 02, 2021 8:13 am

Note: This question is part of a series of questions that present the same Scenario.Each question I the series contains

Post by answerhappygod »

Note: This question is part of a series of questions that present the same Scenario.Each question I the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution while others might not have correct solution.Start of Repeated Scenario:You are planning a big data infrastructure by using an Apache Spark Cluster in AzureHDInsight. The cluster has 24 processor cores and 512 GB of memory.The Architecture of the infrastructure is shown in the exhibit:The architecture will be used by the following users:* Support analysts who run applications that will use REST to submit Spark jobs.* Business analysts who use JDBC and ODBC client applications from a real-time view.The business analysts run monitoring quires to access aggregate result for 15 minutes.The result will be referenced by subsequent quires.* Data analysts who publish notebooks drawn from batch layer, serving layer and speed layer queries. All of the notebooks must support native interpreters for data sources that are bath processed. The serving layer queries are written in Apache Hive and must support multiple sessions. Unique GUIDs are used across the data sources, which allow the data analysts to use Spark SQL.The data sources in the batch layer share a common storage container. The Following data sources are used:* Hive for sales data* Apache HBase for operations data* HBase for logistics data by suing a single region server.End of Repeated scenario.You need to ensure that the support analysts can develop embedded analytics applications by using the least amount of development effort.Which technology should you implement?
Note This Quest 1
Note This Quest 1 (246.58 KiB) Viewed 78 times
A. Zeppelin
B. Jupyter
C. Apache Ambari
D. Livy
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!

This topic has 1 reply

You must be a registered member and logged in to view the replies in this topic.


Register Login
 
Post Reply