Q95 — AWS DEA-C01 Ch.1
Question 95 of 100 | ← Chapter 1
A company plans to use Amazon Kinesis Data Firehose to store data in Amazon S3. The source data consists of 2 MB .csv files. The company must convert the .csv files to JSON format. The company must store the files in Apache Parquet format. Which solution will meet these requirements with the LEAST development effort?
- A. Use Kinesis Data Firehose to convert the .csv files to JSON. Use an AWS Lambda function to store the files in Parquet format.
- B. Use Kinesis Data Firehose to convert the .csv files to JSON and to store the files in Parquet format. ✓
- C. Use Kinesis Data Firehose to invoke an AWS Lambda function that transforms the .csv files to JSON and stores the files in Parquet format.
- D. Use Kinesis Data Firehose to invoke an AWS Lambda function that transforms the .csv files to JSON. Use Kinesis Data Firehose to store the files in Parquet format.
Correct Answer: B. Use Kinesis Data Firehose to convert the .csv files to JSON and to store the files in Parquet format.
Explanation
答案B是正确的。KinesisDataFirehose本身就支持将数据从.csv格式转换为JSON格式,并将其存储为Parquet格式,无需额外调用Lambda函数进行复杂的处理,从而以最小的开发努力满足需求。选项A中使用Lambda函数存储为Parquet格式增加了开发工作。选项C调用Lambda函数进行转换和存储增加了复杂度和开发工作量。选项D调用Lambda函数转换为JSON后,再用KinesisDataFirehose存储为Parquet格式,不如直接使用KinesisDataFirehose一步完成来得简便,开发工作量相对较大。综上所述,选项B是最优解。