A data engineer needs to join data from multiple sources to perform a one-time analysis job. The data is store

Question

A data engineer needs to join data from multiple sources to perform a one-time analysis job. The data is stored in Amazon DynamoDB,
Amazon RDS, Amazon Redshift, and Amazon S3.
Which solution will meet this requirement MOST cost-effectively?

Accepted Answer

C. Use Amazon Athena Federated Query to join the data from all data sources.

Answer

A. Use an Amazon EMR provisioned cluster to read from all sources. Use Apache Spark to join the data and perform the analysis.

Answer

B. Copy the data from DynamoDB, Amazon RDS, and Amazon Redshift into Amazon S3. Run Amazon Athena queries directly on the S3 files.

Answer

D. Use Redshift Spectrum to query data from DynamoDB, Amazon RDS, and Amazon S3 directly from Redshift.

Q21 — AWS DEA-C01 Ch.1

Correct Answer: C. Use Amazon Athena Federated Query to join the data from all data sources.

Explanation