A customer's machine learning process entails numerous rapid-fire cycles of reads-writes-reads on Amazon S3. The client

Business, Finance, Economics, Accounting, Operations Management, Computer Science, Electrical Engineering, Mechanical Engineering, Civil Engineering, Chemical Engineering, Algebra, Precalculus, Statistics and Probabilty, Advanced Math, Physics, Chemistry, Biology, Nursing, Psychology, Certifications, Tests, Prep, and more.
Post Reply
answerhappygod
Site Admin
Posts: 899604
Joined: Mon Aug 02, 2021 8:13 am

A customer's machine learning process entails numerous rapid-fire cycles of reads-writes-reads on Amazon S3. The client

Post by answerhappygod »

A customer's machine learning process entails numerous rapid-fire cycles of reads-writes-reads on Amazon S3. The client needs to run the process on the EMR but is worried that future cycles' readings may miss important new data from the previous cycles' machine learning.

How should the consumer go about doing this?

A. Turn on EMRFS consistent view when configuring the EMR cluster.
B. Use AWS Data Pipeline to orchestrate the data processing cycles.
C. Set hadoop.data.consistency = true in the core-site.xml file.
D. Set hadoop.s3.consistency = true in the core-site.xml file.
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!

This topic has 1 reply

You must be a registered member and logged in to view the replies in this topic.


Register Login
 
Post Reply