Q95 — AWS AIF-C01 Ch.1
Question 95 of 100 | ← Chapter 1
An AI practitioner wants to use a foundation model (FM) to design a search application. The search application must support queries containing both text and images. Which type of feature matching (FM) should the AI practitioner use to support this search application?
- A. Multimodal embedding model ✓
- B. Text embedding model
- C. Multimodal generative model
- D. Image generation model
Correct Answer: A. Multimodal embedding model
Explanation
To handle queries containing both text and images, a model capable of processing both modalities simultaneously is required. Multimodal embedding models are specifically designed for such tasks—they encode both text and images into numerical vectors or tensors and use neural networks to train parameters for prediction and matching across modalities. Thus, multimodal embedding models are the ideal choice for feature matching in this search application.