Q42 — AWS AIF-C01 Ch.2
Question 42 of 100 | ← Chapter 2
An AI developer has built a deep learning model to classify material types in images. Now, the developer wants to evaluate the model’s performance. Which metric helps the AI developer evaluate the model’s performance?
- A. Confusion matrix ✓
- B. Correlation matrix
- C. R² score
- D. Mean squared error (MSE)
Correct Answer: A. Confusion matrix
Explanation
When evaluating deep learning model performance, selecting a metric that accurately reflects classification capability is essential. A confusion matrix clearly shows correct and incorrect classifications per class, enabling comprehensive assessment of model behavior. In contrast, a correlation matrix measures variable relationships, R² score is used primarily for regression, and MSE quantifies prediction error magnitude—none directly reflect classification accuracy. Thus, the confusion matrix is the appropriate metric for classification tasks.