A data scientist is using Amazon SageMaker to conduct text generation experiments with a large language model

Question

A data scientist is using Amazon SageMaker to conduct text generation experiments with a large language model (LLM). The data scientist wants to evaluate whether the model exhibits bias related to gender, age, or race in its responses. Which type of evaluation satisfies these requirements?

Accepted Answer

B. Prompt stereotyping

Answer

A. Factual knowledge

Answer

C. Toxicity

Answer

D. Semantic robustness

Q9 — AWS AIF-C01 Ch.3

Correct Answer: B. Prompt stereotyping

Explanation