Question 31

Domain 3: Implement Generative AI Solutions

Your company needs an Azure OpenAI deployment that guarantees maximum of 2-second response times for a high-volume customer-facing application processing 500 requests per minute. Which deployment type should you use?

A. Standard (S1) — pay-per-call B. Global Standard — automatic routing C. Provisioned Throughput Unit (PTU) — reserved capacity D. Serverless API — consumption-based

Previous Next

Question 31

Explanation

Why each option is right or wrong