Q82 — AWS AIF-C01 Ch.2

Question 82 of 100 | ← Chapter 2

A company needs to identify a generative AI model capable of interpreting image content. Which type of model satisfies these requirements?

A. Large Language Model (LLM)
B. Diffusion Model
C. Multimodal Model ✓
D. Natural Language Processing (NLP) Model

Correct Answer: C. Multimodal Model

Explanation

This question tests understanding of AI model capabilities. Multimodal models are specifically designed to process and integrate multiple data types — such as images and text — enabling them to interpret visual content and generate corresponding textual descriptions. Large Language Models (LLMs) operate exclusively on text sequences and lack native image understanding. Diffusion models excel at generating images from text prompts but are not inherently designed for image interpretation. NLP models focus solely on linguistic tasks. Therefore, only multimodal models satisfy the requirement to interpret image content.