A data engineer at a manufacturing firm is developing a platform for analyzing huge amounts of unstructured data. The data engineer must fill an Amazon Redshift star schema with well-structured data.
Which architectural approach is the most efficient for this purpose?
A. Transform the unstructured data using Amazon EMR and generate CSV data. COPY the CSV data into the analysis schema within Redshift.
B. Load the unstructured data into Redshift, and use string parsing functions to extract structured data for inserting into the analysis schema.
C. When the data is saved to Amazon S3, use S3 Event Notifications and AWS Lambda to transform the file contents. Insert the data into the analysis schema on Redshift.
D. Normalize the data using an AWS Marketplace ETL tool, persist the results to Amazon S3, and use AWS Lambda to INSERT the data into Redshift.
A data engineer at a manufacturing firm is developing a platform for analyzing huge amounts of unstructured data. The da
-
answerhappygod
- Site Admin
- Posts: 899604
- Joined: Mon Aug 02, 2021 8:13 am
A data engineer at a manufacturing firm is developing a platform for analyzing huge amounts of unstructured data. The da
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!