An company is presently running a big Hadoop system in its data center and is in the process of migrating to AWS through Amazon EMR.
They produce about 20 TB of data each month. Additionally, files must be aggregated and transferred to Amazon S3 on a monthly basis for usage in the Amazon EMR environment. They have a number of S3 buckets spread across different AWS accounts to which data must be transferred. Between their data center and AWS, they have a 10G AWS Direct Connect connection, and the network team has agreed to dedicate 50% of the AWS Direct Connect capacity to data transmission. The data transmission process cannot exceed two days.
What is the MOST EFFECTIVE method for transferring data to AWS on a monthly basis?
A. Use an offline copy method, such as an AWS Snowball device, to copy and transfer data to Amazon S3.
B. Configure a multipart upload for Amazon S3 on AWS Java SDK to transfer data over AWS Direct Connect.
C. Use Amazon S3 transfer acceleration capability to transfer data to Amazon S3 over AWS Direct Connect.
D. Setup S3DistCop tool on the on-premises Hadoop environment to transfer data to Amazon S3 over AWS Direct Connect.
An company is presently running a big Hadoop system in its data center and is in the process of migrating to AWS through
-
answerhappygod
- Site Admin
- Posts: 899604
- Joined: Mon Aug 02, 2021 8:13 am
An company is presently running a big Hadoop system in its data center and is in the process of migrating to AWS through
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!