Q98 — AWS DEA-C01 Ch.1

Question 98 of 100 | ← Chapter 1

A data engineer is building an automated extract, transform, and load (ETL) ingestion pipeline by using AWS Glue. The pipeline ingests compressed files that are in an Amazon S3 bucket. The ingestion pipeline must support incremental data processing. Which AWS Glue feature should the data engineer use to meet this requirement?

Correct Answer: C. Job bookmarks

Explanation

在这个场景中,数据工程师需要构建一个支持增量数据处理的自动化ETL摄取管道。AWSGlue的“Jobbookmarks”功能允许ETL作业记住上次处理的位置,这样在下一次运行时,它就可以从上次停止的地方继续处理,而不是重新开始。这对于处理不断增长的数据集非常有用,因为它可以节省时间和资源,只处理新数据或更改的数据。因此,选项C“Jobbookmarks”是正确的答案。