Exit 30 / 40

Question 30

Domain 2: Core Machine Learning, AI, and Transformer Foundations

What is Multi-Query Attention (MQA) and how does it differ from standard multi-head attention? (Select TWO)