A company built a generative AI solution using large language models (LLMs) to translate training manuals from

Question

A company built a generative AI solution using large language models (LLMs) to translate training manuals from English into other languages. The company wants to assess the solution’s accuracy by reviewing the generated translated text. Which model evaluation strategy satisfies this requirement?

Accepted Answer

A. Bilingual Evaluation Understudy (BLEU)

Answer

B. Root Mean Square Error (RMSE)

Answer

C. Recall-Oriented Understudy for Gisting Evaluation (ROUGE)

Answer

D. F1 score

Q79 — AWS AIF-C01 Ch.1

Correct Answer: A. Bilingual Evaluation Understudy (BLEU)

Explanation