Q62 — AWS AIF-C01 Ch.2

Question 62 of 100 | ← Chapter 2

A company is developing a customer service chatbot and wants to enhance its response capability by learning from past interactions and online resources.

Correct Answer: B. Reinforcement learning with rewards for positive customer feedback.

Explanation

Reinforcement learning with rewards for positive customer feedback (B) enables the chatbot to iteratively improve responses based on real-world interaction outcomes. By treating user satisfaction as a reward signal, the model learns which responses lead to successful interactions—directly supporting autonomous, experience-driven improvement. While supervised approaches (A, D) rely on static or curated labels, and unsupervised clustering (C) lacks goal-directed learning, reinforcement learning best fulfills the requirement of self-improvement through historical interactions and dynamic feedback.