This post is a part of the AB-731: AI Transformation Leader Exam Prep Hub.
This topic falls under these sections:
Identify benefits, capabilities, and opportunities for Microsoft’s AI apps and services (35–40%)
--> Identify benefits and capabilities of Foundry Tools
--> Match an AI model to a business need
Note that there are 10 practice questions (with answers) at the end of each section to help you solidify your knowledge of the material. Also, there are 4 practice tests with 30 questions each available from the hub's main page below the exam topics section.
Introduction
One of the responsibilities of an AI Transformation Leader is understanding which AI models are most appropriate for specific business scenarios. Leaders do not necessarily build models themselves, but they must be able to align business requirements with the capabilities of available AI models and services.
Within Microsoft Foundry Tools (Azure AI Foundry), organizations can access multiple model families and choose the right model based on cost, speed, accuracy, multimodal capabilities, reasoning requirements, and business objectives.
Why Model Selection Matters
Choosing the wrong AI model can lead to:
- Increased costs
- Poor response quality
- Slow performance
- Hallucinations or inaccuracies
- Limited scalability
- Unsatisfactory user experiences
Choosing the right model helps organizations:
- Improve business outcomes
- Reduce development effort
- Optimize costs
- Increase productivity
- Deliver better customer experiences
Factors to Consider When Selecting an AI Model
AI Transformation Leaders should evaluate:
Business Objective
Determine:
- What problem needs to be solved?
- Who are the users?
- What outcomes are expected?
Examples:
| Objective | Possible Need |
|---|---|
| Customer support | Conversational AI |
| Document summarization | Text generation |
| Product recommendations | Prediction models |
| Image analysis | Vision models |
| Process automation | Agents and workflows |
Accuracy Requirements
Some workloads require:
- High precision
- Strong reasoning
- Low hallucination rates
Examples:
- Legal analysis
- Financial reporting
- Healthcare documentation
These scenarios often benefit from larger and more capable models.
Response Speed
Certain use cases prioritize fast responses.
Examples:
- Chatbots
- Website assistants
- Interactive applications
Smaller models often provide faster responses with lower cost.
Cost Considerations
Larger models generally:
- Cost more
- Consume more compute resources
Smaller models may provide sufficient quality for routine tasks.
Organizations should balance:
- Performance
- Cost
- Business value
Data Types
Different models support different inputs:
| Input Type | Appropriate Model |
|---|---|
| Text | Language models |
| Images | Vision models |
| Audio | Speech models |
| Mixed content | Multimodal models |
Categories of AI Models
Large Language Models (LLMs)
LLMs specialize in:
- Text generation
- Summarization
- Question answering
- Content creation
- Translation
Typical business scenarios:
- Customer service
- Knowledge assistants
- Drafting emails
- Meeting summaries
Examples available through Microsoft Foundry include OpenAI models such as GPT family models.
Reasoning Models
Reasoning models are designed for:
- Complex analysis
- Multi-step thinking
- Data interpretation
- Problem solving
Business scenarios include:
- Strategic planning
- Financial analysis
- Research tasks
- Advanced reporting
These models may trade speed for deeper reasoning capabilities.
Small Language Models (SLMs)
Small language models provide:
- Lower cost
- Faster responses
- Efficient deployment
Best suited for:
- Routine tasks
- Lightweight assistants
- High-volume workloads
Organizations may not always need the largest available model.
Vision Models
Vision models analyze:
- Images
- Documents
- Photographs
- Visual content
Common scenarios:
- Manufacturing quality inspections
- OCR and document processing
- Retail product recognition
- Healthcare imaging support
Azure AI Vision supports many of these capabilities.
Speech Models
Speech models support:
- Speech-to-text
- Text-to-speech
- Translation
Business uses include:
- Call centers
- Accessibility solutions
- Meeting transcription
Embedding Models
Embedding models convert content into vectors for similarity search.
These models are commonly used with:
- Azure AI Search
- Retrieval-Augmented Generation (RAG)
- Knowledge retrieval systems
Business scenarios:
- Enterprise search
- Internal knowledge assistants
- Document retrieval
Multimodal Models
Multimodal models work with:
- Text
- Images
- Documents
Examples include:
- Uploading an image and asking questions about it.
- Analyzing diagrams and generating summaries.
These models are useful when business data exists in multiple formats.
Matching Models to Business Needs
Scenario 1: Employee Knowledge Assistant
Requirement:
- Answer questions from internal documents.
Recommended approach:
- Large language model + Azure AI Search + embeddings.
Reason:
- The model generates responses while search provides grounding.
Scenario 2: Invoice Processing
Requirement:
- Extract information from receipts.
Recommended approach:
- Vision model with OCR capabilities.
Reason:
- Image understanding is more important than text generation.
Scenario 3: High-Volume Chatbot
Requirement:
- Fast and inexpensive customer interactions.
Recommended approach:
- Smaller language model.
Reason:
- Lower latency and reduced cost.
Scenario 4: Strategic Financial Analysis
Requirement:
- Multi-step reasoning and insights.
Recommended approach:
- Advanced reasoning model.
Reason:
- Complex decision-making requires stronger analytical capabilities.
Scenario 5: Product Image Recognition
Requirement:
- Identify products from photographs.
Recommended approach:
- Vision models.
Reason:
- Visual understanding is required.
Scenario 6: Enterprise RAG Solution
Requirement:
- Reduce hallucinations and use organizational knowledge.
Recommended approach:
- LLM + Azure AI Search + embedding model.
Reason:
- Search retrieves data and the LLM generates grounded answers.
Model Selection in Microsoft Foundry
Microsoft Foundry enables organizations to:
Access Multiple Models
Leaders can compare models from:
- Microsoft
- OpenAI
- Third-party providers
Evaluate Performance
Organizations can assess:
- Accuracy
- Relevance
- Groundedness
- Safety
Experiment Before Deployment
Teams can:
- Test prompts
- Compare outputs
- Optimize costs
Scale Solutions
Foundry provides:
- Governance
- Monitoring
- Responsible AI controls
Trade-Offs in Model Selection
| Priority | Preferred Choice |
|---|---|
| Highest reasoning quality | Large reasoning model |
| Lowest cost | Small language model |
| Fast responses | Small language model |
| Image analysis | Vision model |
| Knowledge retrieval | Embedding model + AI Search |
| Multiple content types | Multimodal model |
| Complex document understanding | Large language model |
Common Exam Concepts
Remember:
- No single model is best for every scenario.
- Model selection should align with business requirements.
- Larger models provide greater capability but higher cost.
- Smaller models improve speed and efficiency.
- Vision models process images.
- Embedding models support retrieval and RAG.
- Multimodal models work with multiple data types.
- Microsoft Foundry allows organizations to compare and evaluate models.
Practice Exam Questions
Question 1
A company needs an AI solution that extracts text from scanned receipts and invoices. Which type of model best fits this requirement?
A. Embedding model
B. Speech model
C. Vision model
D. Reasoning model
Answer: C
Explanation
Vision models support OCR and image analysis.
- A is incorrect because embeddings are used for similarity search.
- C is incorrect because speech models process audio.
- D is incorrect because reasoning models focus on complex analysis.
Question 2
Which factor should primarily drive AI model selection?
A. The newest model available
B. Vendor popularity
C. Business requirements and desired outcomes
D. Maximum parameter count
Answer: C
Explanation
Business objectives should determine model selection.
- A and B do not guarantee suitability.
- D focuses only on model size rather than business value.
Question 3
An organization needs a low-cost chatbot that handles thousands of routine customer questions daily. Which option is most appropriate?
A. Image-generation model
B. Vision model
C. Speech model
D. Small language model
Answer: D
Explanation
Small language models provide fast and economical responses.
- B and C process different data types.
- D creates images rather than conversations.
Question 4
Which type of model is commonly used to support Retrieval-Augmented Generation (RAG)?
A. Speech model
B. Video model
C. Image-generation model
D. Embedding model
Answer: D
Explanation
Embedding models convert content into vectors used for retrieval.
- The other model types are unrelated to similarity search.
Question 5
A legal department needs highly accurate analysis of lengthy contracts with complex reasoning. Which model is most appropriate?
A. Lightweight chatbot model
B. Reasoning model
C. Speech model
D. Vision model
Answer: B
Explanation
Reasoning models are optimized for complex, multi-step analysis.
- A prioritizes speed over depth.
- C and D address other modalities.
Question 6
Which statement about larger AI models is true?
A. They always cost less to operate.
B. They eliminate the need for governance.
C. They generally provide greater capability but may increase cost.
D. They are only used for image analysis.
Answer: C
Explanation
Larger models often deliver stronger performance but require more resources.
- A is false because costs usually increase.
- B is false because governance remains essential.
- D is incorrect because large models are used across many workloads.
Question 7
A retailer wants customers to upload photographs and ask questions about products shown in the image. Which model type best supports this requirement?
A. Embedding model
B. Speech model
C. Multimodal model
D. Time-series model
Answer: C
Explanation
Multimodal models can process both images and text together.
- A supports retrieval.
- B processes audio.
- D is unrelated.
Question 8
Which Microsoft platform enables organizations to compare and evaluate multiple AI models?
A. Microsoft Defender for Endpoint
B. Microsoft Foundry
C. Microsoft Intune
D. Microsoft Purview
Answer: B
Explanation
Microsoft Foundry provides model catalogs, evaluations, and experimentation tools.
- The other services address security and governance functions.
Question 9
A company wants an AI assistant that answers employee questions using internal documents while minimizing hallucinations. Which approach is best?
A. Standalone image model
B. Speech model only
C. Large language model without data grounding
D. Large language model combined with Azure AI Search
Answer: D
Explanation
Grounding responses with Azure AI Search improves accuracy and trustworthiness.
- A and B do not address document retrieval.
- C increases the risk of hallucinations.
Question 10
Which model type primarily handles speech-to-text conversion?
A. Speech model
B. Embedding model
C. Vision model
D. Reasoning model
Answer: A
Explanation
Speech models are designed for audio processing.
- Embedding, vision, and reasoning models serve different purposes.
Go to the AB-731 Exam Prep Hub main page
