Q40 — AWS DEA-C01 Ch.1
Question 40 of 100 | ← Chapter 1
A data engineer must orchestrate a data pipeline that consists of one AWS Lambda function and one AWS Glue job. The solution must Integrate with AWS services. Which solution will meet these requirements with the LEAST management overhead?
- A. Use an AWS Step Functions workflow that includes a state machine. Configure the state machine to run the Lambda function and then the AWS Glue job. ✓
- B. Use an Apache Airflow workflow that is deployed on an Amazon EC2 instance. Define a directed acyclic graph (DAG) in which the first task is to call the Lambda function and the second task is to call the AWS Glue job.
- C. Use an AWS Glue workflow to run the Lambda function and then the AWS Glue job.
- D. Use an Apache Airflow workflow that is deployed on Amazon Elastic Kubernetes Service (Amazon EKS). Define a directed acyclic graph (DAG) in which the first task is to call the Lambda function and the second task is to call the AWS Glue job.
Correct Answer: A. Use an AWS Step Functions workflow that includes a state machine. Configure the state machine to run the Lambda function and then the AWS Glue job.
Explanation
答案A是正确的,因为AWSStepFunctions是一种专门用于协调和管理无服务器工作流的服务,能很好地集成AWS服务,且具有较低的管理开销。它的状态机配置可以方便地按顺序运行AWSLambda函数和AWSGlue作业,无需复杂的部署和维护。而选项B中在AmazonEC2实例上部署ApacheAirflow以及选项D在AmazonEKS上部署,都需要额外的基础设施管理和配置。选项C仅使用AWSGlue工作流,可能无法像StepFunctions那样灵活全面地满足需求。所以综合来看,选项A是满足要求且管理开销最少的解决方案。