Exit 37 / 40

Question 37

Domain 3: Implement Generative AI Solutions

A company evaluates their RAG-based chatbot. Internal testers rate answers as relevant and grounded, but real users report the answers don't feel natural to read. Which evaluation metric is lowest?