Q43 — AWS SAA-C03 Ch.3
Question 43 of 65 | ← Chapter 3
Q173. A company is preparing a new data platform that will ingest real-time streaming data from multiple sources. The company needs to transform the data before writing the data to Amazon S3. The company needs the ability to use SQL to query the transformed data.Which solutions will meet these requirements? (Select TWO.)
- A. Use Amazon Kinesis Data Streams to stream the data. Use Amazon Kinesis Data Analytics to transform the data. Use Amazon Kinesis Data Firehose to write the data to Amazon S3. Use Amazon Athena to query the transformed data from Amazon S3 ✓
- B. Use Amazon Managed Streaming for Apache Kafka (Amazon MSK) to stream the data. Use AWS Glue to transform the data and to write the data to Amazon S3. Use Amazon Athena to query the transformed data from Amazon S3 ✓
- C. Use AWS Database Migration Service (AWS DMS) to ingest the data. Use Amazon EMR to transform the data and to write the data to Amazon S3.Use Amazon Athena to query the transformed data from Amazon S3
- D. Use Amazon Managed Streaming for Apache Kafka (Amazon MSK) to stream the data. Use Amazon Kinesis Data Analytics to transform the data and towrite the data to Amazon S3. Use the Amazon RDS query editor to query the transformed data from Amazon S3
- E. Use Amazon Kinesis Data Streams to stream the data. Use AWS Glue to transform the data. Use Amazon Kinesis Data Firehose to write the data toAmazon S3. Use the Amazon RDS query editor to query the transformed data from Amazon S3
Correct Answer: A. Use Amazon Kinesis Data Streams to stream the data. Use Amazon Kinesis Data Analytics to transform the data. Use Amazon Kinesis Data Firehose to write the data to Amazon S3. Use Amazon Athena to query the transformed data from Amazon S3, B. Use Amazon Managed Streaming for Apache Kafka (Amazon MSK) to stream the data. Use AWS Glue to transform the data and to write the data to Amazon S3. Use Amazon Athena to query the transformed data from Amazon S3
Explanation
To meet the requirement of ingesting real-time streaming data from multiple sources, transforming the data before writing it to Amazon S3, and querying the transformed data with SQL, a solutions architect should use Amazon Kinesis Data Streams or Amazon Managed Streaming for Apache Kafka (Amazon MSK) to stream the data. AWS Glue or Amazon Kinesis Data Analytics can be used to transform the data, and then Amazon Kinesis Data Firehose can be used to write the data to Amazon S3. Finally, Amazon Athena can be used to query the transformed data from Amazon S3. Therefore, options A and B are correct.Option C suggests using AWS Database Migration Service (AWS DMS) to ingest the data, which is not designed for real-time streaming data ingestion. Amazon EMR is suggested for transforming the data and writing it to Amazon S3, but this approach is more complex than necessary and may not provide the same level of scalability and cost-effectiveness as using AWS Glue or Amazon Kinesis Data Analytics.Option D suggests using Amazon Kinesis Data Analytics to transform the data and Amazon RDS query editor to query the transformed data from Amazon S3. While this approach could work, using Amazon RDS would add an unnecessary layer of complexity to the solution and may not provide the same level of scalability and cost-effectiveness as using Amazon Athena.Option E suggests using AWS Glue to transform the data and the Amazon RDS query editor to query the transformed data from Amazon S3. Again, using Amazon RDS would add an unnecessary layer of complexity to the solution and may not provide the same level of scalability and cost-effectiveness as using Amazon Athena.Using Amazon Kinesis Data Streams or Amazon MSK provides a highly scalable and durable solution for ingesting real-time streaming data. AWS Glue or Amazon Kinesis Data Analytics can be used to transform the data before writing it to Amazon S3 with Amazon Kinesis Data Firehose.