A company is creating a mobile social application and need logs from all devices on which the program is installed. Amazon Kinesis Data Streams is being evaluated to push logs and Amazon EMR is being evaluated to handle data. They wish to store data on HDFS and replicate it throughout the cluster using the default replication factor, but they are worried about the data's longevity. They are now generating 300 GB of raw data each day, with surges during exceptional events. They will need to expand the Amazon EMR cluster to accommodate the additional streaming data.
Which method avoids data loss while still meeting computational demand?
A. Use multiple Amazon EBS volumes on Amazon EMR to store processed data and scale out the Amazon EMR cluster as needed.
B. Use the EMR File System and Amazon S3 to store processed data and scale out the Amazon EMR cluster as needed.
C. Use Amazon DynamoDB to store processed data and scale out the Amazon EMR cluster as needed.
D. use Amazon Kinesis Data Firehose and, instead of using Amazon EMR, stream logs directly into Amazon Elasticsearch Service.
A company is creating a mobile social application and need logs from all devices on which the program is installed. Amaz
-
answerhappygod
- Site Admin
- Posts: 899604
- Joined: Mon Aug 02, 2021 8:13 am
A company is creating a mobile social application and need logs from all devices on which the program is installed. Amaz
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!