Q30 — AWS DEA-C01 Ch.1
Question 30 of 100 | ← Chapter 1
A company is migrating its database servers from Amazon EC2 instances that run Microsoft SQL Server to Amazon RDS for Microsoft SQL Server DB instances. The company's analytics team must export large data elements every day until the migration is complete. The data Elements are the result of SQL joins across multiple tables. The data must be in Apache Parquet format. The analytics team must store the Data in Amazon S3. Which solution will meet these requirements in the MOST operationally ecient way?
- A. Create a view in the EC2 instance-based SQL Server databases that contains the required data elements. Create an AWS Glue job that Selects the data directly from the view and transfers the data in Parquet format to an S3 bucket. Schedule the AWS Glue job to run every Day.
- B. Schedule SQL Server Agent to run a daily SQL query that selects the desired data elements from the EC2 instance-based SQL Server Databases. Congure the query to direct the output .csv objects to an S3 bucket. Create an S3 event that invokes an AWS Lambda function To transform the output format from .csv to Parquet.
- C. Use a SQL query to create a view in the EC2 instance-based SQL Server databases that contains the required data elements. Create And run an AWS Glue crawler to read the view. Create an AWS Glue job that retrieves the data and transfers the data in Parquet format to An S3 bucket. Schedule the AWS Glue job to run every day. ✓
- D. Create an AWS Lambda function that queries the EC2 instance-based databases by using Java Database Connectivity (JDBC). Congure The Lambda function to retrieve the required data, transform the data into Parquet format, and transfer the data into an S3 bucket. Use Amazon EventBridge to schedule the Lambda function to run every day.
Correct Answer: C. Use a SQL query to create a view in the EC2 instance-based SQL Server databases that contains the required data elements. Create And run an AWS Glue crawler to read the view. Create an AWS Glue job that retrieves the data and transfers the data in Parquet format to An S3 bucket. Schedule the AWS Glue job to run every day.
Explanation
为了最高效地满足需求,需要一种解决方案,该方案能直接从EC2实例的SQLServer数据库中选择数据,转换成Parquet格式,并存储到S3中。选项C符合这些要求:-它首先使用SQL查询在EC2实例的SQLServer数据库中创建一个视图,包含所需的数据元素。-接着,使用AWSGlueCrawler读取这个视图,为GlueDataBrew准备元数据。-然后,创建一个AWSGlue作业,该作业检索数据并将其以Parquet格式传输到S3存储桶。-最后,将AWSGlue作业安排为每天运行,确保每天都能导出所需的数据。这种方法减少了中间步骤(如先导出到CSV再转换格式),并且利用AWSGlue直接处理数据转换和存储,提高了整体的操作效率。因此,选项C是最合适的解决方案。