Question 33
Domain 1: Plan and Manage an Azure AI SolutionA pharmaceutical company processes drug trial documentation, lab reports, audio recordings from team meetings, and medical images as part of their research workflow. They need a unified pipeline that extracts entities, summaries, and tables across all these modalities. Which service provides native multi-modal document processing?
Correct answer: C
Explanation
Azure Content Understanding in Foundry Tools is built for native multi-modal document processing, so it can handle text, audio, images, and structured content in one pipeline. It supports extracting entities, summaries, and tables from varied inputs, matching the need for a unified workflow across trial documents, lab reports, recordings, and medical images.
Why each option is right or wrong
A. Azure AI Search with multiple skillsets
B. Azure Document Intelligence with composed models
C. Azure Content Understanding in Foundry Tools
Azure Content Understanding in Foundry Tools is the only option designed for native multi-modal ingestion, so it can process text, audio, images, and structured content through one pipeline rather than requiring separate OCR, speech, and document services. In this scenario, that matters because the workflow spans trial documents, lab reports, meeting recordings, and medical images, and the service can return entities, summaries, and tables from those mixed inputs in a single pass.
D. Azure OpenAI with GPT-4o vision