A customer's machine learning process entails numerous rapid-fire cycles of reads-writes-reads on Amazon S3. The client needs to run the process on the EMR but is worried that future cycles' readings may miss important new data from the previous cycles' machine learning.
How should the consumer go about doing this?
A. Turn on EMRFS consistent view when configuring the EMR cluster.
B. Use AWS Data Pipeline to orchestrate the data processing cycles.
C. Set hadoop.data.consistency = true in the core-site.xml file.
D. Set hadoop.s3.consistency = true in the core-site.xml file.
A customer's machine learning process entails numerous rapid-fire cycles of reads-writes-reads on Amazon S3. The client
-
answerhappygod
- Site Admin
- Posts: 899604
- Joined: Mon Aug 02, 2021 8:13 am
A customer's machine learning process entails numerous rapid-fire cycles of reads-writes-reads on Amazon S3. The client
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!