Q47 — AWS DEA-C01 Ch.1

Question 47 of 100 | ← Chapter 1

A data engineer needs to maintain a central metadata repository that users access through Amazon EMR and Amazon Athena queries. The Repository needs to provide the schema and properties of many tables. Some of the metadata is stored in Apache Hive. The data engineer Needs to import the metadata from Hive into the central metadata repository. Which solution will meet these requirements with the LEAST development effort?

Correct Answer: C. Use the AWS Glue Data Catalog.

Explanation

AWSGlueDataCatalog是一个完全托管的元数据存储服务,专为AWS服务设计,如AmazonEMR和AmazonAthena。它可以直接与Hive兼容,允许数据工程师轻松地从Hive元存储中导入元数据到AWSGlueDataCatalog,而无需额外的开发工作。选项C通过利用AWSGlueDataCatalog的现有功能和与Hive的兼容性,提供了最少开发工作的解决方案。其他选项要么需要额外的设置(如A和D),要么不是为这种集成而优化的(如B)。