Q17 — AWS SAA-C03 Ch.4
Question 17 of 105 | ← Chapter 4
Q212. A hospital recently deployed a RESTful API with Amazon API Gateway and AWS Lambda. The hospital uses API Gateway and Lambda to upload reports that are in PDF format and JPEG format. The hospital needs to modify the Lambda code to identify protected health information (PHI) in the reports. Which solution will meet these requirements with the LEAST operational overhead?
- A. Use existing Python libraries to extract the text from the reports and to identify the PHI from the extracted text.
- B. Use Amazon Textract to extract the text from the reports. Use Amazon SageMaker to identify the PHI from the extracted text
- C. Use Amazon Textract to extract the text from the reports. Use Amazon Comprehend Medical to identify the PHI from the extracted text. ✓
- D. Use Amazon Rekognition to extract the text from the reports. Use Amazon Comprehend Medical to identify the PHI from the extracted text
Correct Answer: C. Use Amazon Textract to extract the text from the reports. Use Amazon Comprehend Medical to identify the PHI from the extracted text.
Explanation
To identify protected health information (PHI) in the reports uploaded to a RESTful API with Amazon API Gateway and AWS Lambda with the least operational overhead, a hospital should use Amazon Textract to extract the text from the reports and use Amazon Comprehend Medical to identify the PHI from the extracted text. Therefore, option C is the correct answer. Option A suggests using existing Python libraries to extract the text from the reports and identify the PHI from the extracted text. While this approach can work, it requires more manual setup and management compared to using Amazon Textract and Amazon Comprehend Medical. Option B suggests using Amazon Textract to extract the text from the reports and using Amazon SageMaker to identify the PHI from the extracted text. While this approach can work, it may require additional infrastructure management compared to using Amazon Comprehend Medical, which is specifically designed for medical data analysis. Option D suggests using Amazon Rekognition to extract the text from the reports and using Amazon Comprehend Medical to identify the PHI from the extracted text. While Amazon Rekognition can detect text in images, it is not designed specifically for document analysis and may not provide accurate results for identifying PHI. By using Amazon Textract to extract the text from the reports and Amazon Comprehend Medical to identify the PHI from the extracted text, the hospital can easily implement automatic PHI identification without requiring complex infrastructure management or custom code development. Amazon Textract provides accurate text extraction from scanned documents and PDFs, while Amazon Comprehend Medical uses machine learning algorithms to analyze and extract medical data, including PHI. This solution meets the requirements stated in the question while minimizing operational overhead.