Amazon Elastic MapReduce (EMR) is used by an organization to perform a sequence of extract-transform-load (ETL) stages.

Business, Finance, Economics, Accounting, Operations Management, Computer Science, Electrical Engineering, Mechanical Engineering, Civil Engineering, Chemical Engineering, Algebra, Precalculus, Statistics and Probabilty, Advanced Math, Physics, Chemistry, Biology, Nursing, Psychology, Certifications, Tests, Prep, and more.
Post Reply
answerhappygod
Site Admin
Posts: 899604
Joined: Mon Aug 02, 2021 8:13 am

Amazon Elastic MapReduce (EMR) is used by an organization to perform a sequence of extract-transform-load (ETL) stages.

Post by answerhappygod »

Amazon Elastic MapReduce (EMR) is used by an organization to perform a sequence of extract-transform-load (ETL) stages. Each step's output must be completely processed in future stages or it will be discarded.

Which of the following methods will most effectively fulfill this requirement?

A. Use the EMR File System (EMRFS) to store the outputs from each step as objects in Amazon Simple Storage Service (S3).
B. Use the s3n URI to store the data to be processed as objects in Amazon S3.
C. Define the ETL steps as separate AWS Data Pipeline activities.
D. Load the data to be processed into HDFS, and then write the final output to Amazon S3.
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!

This topic has 1 reply

You must be a registered member and logged in to view the replies in this topic.


Register Login
 
Post Reply