Question 40

Domain 2: Core Machine Learning, AI, and Transformer Foundations

**Reported in multiple exam experiences:** What is the purpose of masked attention in transformer decoders?

A. To reduce computational complexity B. To prevent attention to future tokens during training C. To compress attention weights D. To handle variable sequence lengths

Previous See Results

Question 40

Explanation

Why each option is right or wrong