Question 37

Domain 3: Implement Generative AI Solutions

A company evaluates their RAG-based chatbot. Internal testers rate answers as relevant and grounded, but real users report the answers don't feel natural to read. Which evaluation metric is lowest?

A. Groundedness B. Relevance C. Coherence / Fluency D. Similarity

Previous Next

Question 37

Explanation

Why each option is right or wrong