Amazon S3 hosts thousands of text files. The files total 1 PB in size. The files include information on retail orders placed in the last two years. A data engineer must do many interactive searches in order to modify the data. The Data Engineer has access to Amazon Web Services (AWS) and is able to set up an Amazon EMR cluster. The data engineer must be able to
Utilize a clustered application to analyze the data and deliver the findings in an interactive time period.
Which cluster application should the data engineer use?
A. Oozie
B. Apache Pig with Tachyon
C. Apache Hive
D. Presto
Amazon S3 hosts thousands of text files. The files total 1 PB in size. The files include information on retail orders pl
-
answerhappygod
- Site Admin
- Posts: 899604
- Joined: Mon Aug 02, 2021 8:13 am
Amazon S3 hosts thousands of text files. The files total 1 PB in size. The files include information on retail orders pl
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!