Author: thedatacommunity

Choose appropriate deployment options (AI-103 Exam Prep)

This post is a part of the AI-103: Develop AI Apps and Agents on Azure Exam Prep Hub. 
This topic falls under these sections:
Plan and manage an Azure AI solution (25–30%)
--> Set up AI solutions in Foundry
--> Choose appropriate deployment options


Note that there are 10 practice questions (with answers and explanations) at the end of each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available from the hub's main page below the exam topics section.

Introduction

One of the most important responsibilities for Azure AI developers is selecting the correct deployment option for AI applications and agent-based solutions.

Modern AI systems can be deployed in many different ways depending on:

  • Scalability requirements
  • Cost constraints
  • Security requirements
  • Latency expectations
  • Geographic distribution
  • Operational complexity
  • AI workload patterns
  • Enterprise governance needs

The AI-103: Develop AI Apps and Agents on Azure certification exam tests your understanding of how to choose appropriate deployment options for:

  • Generative AI applications
  • AI agents
  • APIs
  • RAG systems
  • Vector search solutions
  • Multimodal applications
  • Enterprise AI systems

For the AI-103 exam, you should understand:

  • Azure deployment models
  • Hosting options
  • Serverless deployments
  • Containerized deployments
  • Kubernetes deployments
  • Regional deployments
  • High availability strategies
  • Scaling approaches
  • CI/CD deployment pipelines
  • Model deployment considerations
  • Infrastructure tradeoffs

What Is a Deployment Option?

A deployment option refers to the method used to host and run an AI application or service.

Deployment choices affect:

  • Performance
  • Reliability
  • Cost
  • Security
  • Scalability
  • Maintainability

Choosing the wrong deployment strategy can:

  • Increase costs
  • Reduce performance
  • Complicate maintenance
  • Create scaling problems

Common Azure AI Deployment Components

AI solutions commonly include:

  • AI models
  • APIs
  • Search systems
  • Databases
  • Agent orchestration
  • Storage systems
  • Monitoring tools
  • Security services

Each component may use different deployment approaches.


Azure OpenAI Deployment Options

Azure OpenAI allows developers to deploy:

  • GPT models
  • Embedding models
  • Multimodal models
  • Fine-tuned models

Deployment considerations include:

  • Region availability
  • Throughput requirements
  • Latency requirements
  • Cost optimization
  • Capacity planning

Standard Deployments

What Are Standard Deployments?

Standard deployments provide shared model hosting infrastructure.

Advantages:

  • Lower operational complexity
  • Managed infrastructure
  • Easier setup

Disadvantages:

  • Shared capacity
  • Potential throughput limitations

Provisioned Throughput Deployments

What Is Provisioned Throughput?

Provisioned throughput reserves dedicated processing capacity.

Advantages:

  • Predictable performance
  • Dedicated throughput
  • Lower latency consistency

Disadvantages:

  • Higher cost
  • Capacity planning required

When to Use Provisioned Throughput

Use provisioned throughput when:

  • Workloads are high volume
  • Predictable latency is critical
  • Enterprise SLAs are required
  • Large-scale copilots are deployed

Serverless Deployment Options

What Is Serverless?

Serverless computing automatically manages infrastructure.

Developers focus on code instead of servers.


Azure Functions

Azure Functions provides event-driven serverless compute.

Common AI use cases:

  • Tool calling
  • Workflow execution
  • API processing
  • Lightweight orchestration
  • Event-triggered AI actions

Advantages of Azure Functions

  • Automatic scaling
  • Pay-per-use pricing
  • Rapid deployment
  • Minimal infrastructure management

Limitations of Azure Functions

  • Execution duration limits
  • Cold starts
  • Less suitable for large persistent workloads

When to Use Azure Functions

Use Azure Functions when:

  • Workloads are event-driven
  • Execution is lightweight
  • Cost optimization is important
  • Rapid scaling is required

Azure Container Apps

Azure Container Apps provides serverless container hosting.

Useful for:

  • AI middleware
  • APIs
  • Agent orchestration
  • Background workers
  • Lightweight microservices

Advantages of Container Apps

  • Simplified container deployment
  • Autoscaling support
  • Event-driven scaling
  • Lower operational overhead than Kubernetes

Kubernetes Deployments

Azure Kubernetes Service (AKS)

AKS provides enterprise-grade container orchestration.

Common AI uses:

  • Multi-agent systems
  • Large-scale AI platforms
  • Distributed AI services
  • Complex orchestration
  • High-volume APIs

Advantages of AKS

  • High scalability
  • Advanced orchestration
  • Fine-grained control
  • Container portability
  • Enterprise-grade deployments

Limitations of AKS

  • Higher operational complexity
  • More infrastructure management
  • Requires Kubernetes expertise

When to Use AKS

Use AKS when:

  • Large-scale deployments exist
  • Multiple microservices interact
  • High traffic is expected
  • Advanced orchestration is needed

Platform-as-a-Service (PaaS) Deployments

Azure App Service

Azure App Service hosts:

  • Web apps
  • APIs
  • AI front ends
  • Lightweight enterprise applications

Advantages of App Service

  • Managed platform
  • Easy deployment
  • Autoscaling support
  • Simplified maintenance

When to Use App Service

Use App Service when:

  • Hosting AI web applications
  • Managing APIs
  • Rapid development is needed
  • Full Kubernetes orchestration is unnecessary

Edge and Hybrid Deployments

Some AI workloads require local or hybrid deployments.

Reasons include:

  • Low latency
  • Regulatory requirements
  • Limited connectivity
  • On-premises data processing

Azure Arc

Azure Arc extends Azure management to:

  • On-premises systems
  • Multi-cloud environments
  • Edge deployments

Useful for hybrid AI environments.


Deployment Considerations for AI Agents

AI agents often require multiple deployment layers.

Examples include:

  • LLM hosting
  • Retrieval systems
  • Tool execution services
  • Workflow orchestration
  • Persistent memory systems

Multi-Service Architectures

AI agents commonly use:

  • Azure OpenAI
  • Azure AI Search
  • Azure Functions
  • Cosmos DB
  • APIs
  • Orchestration workflows

Different components may use different deployment options.


Geographic Deployment Considerations

AI systems may require global deployment strategies.


Regional Deployments

Deploying resources in a specific region helps:

  • Reduce latency
  • Meet compliance requirements
  • Improve user experience

Multi-Region Deployments

Multi-region deployments improve:

  • Availability
  • Disaster recovery
  • Global performance

Availability Zones

Availability Zones provide redundancy across isolated datacenters.

Benefits include:

  • Higher uptime
  • Fault tolerance
  • Improved resilience

High Availability Design

Enterprise AI applications often require:

  • Redundant infrastructure
  • Automatic failover
  • Load balancing
  • Disaster recovery

Load Balancing

Azure Load Balancer and Azure Application Gateway distribute traffic across services.

Benefits:

  • Scalability
  • High availability
  • Traffic optimization

Autoscaling

Autoscaling dynamically adjusts infrastructure based on demand.

Supported by:

  • AKS
  • Azure Functions
  • App Service
  • Container Apps

Deployment Security Considerations

Security is a major AI-103 exam topic.


Microsoft Entra ID

Microsoft Entra ID supports:

  • Authentication
  • Authorization
  • Identity management
  • RBAC

Azure Key Vault

Azure Key Vault securely stores:

  • Secrets
  • API keys
  • Certificates
  • Connection strings

Private Endpoints

Private Endpoints provide secure private connectivity between Azure services.

Useful for:

  • Enterprise AI systems
  • Sensitive data workloads
  • Compliance-driven deployments

CI/CD for AI Deployments

What Is CI/CD?

CI/CD stands for:

  • Continuous Integration
  • Continuous Deployment

CI/CD automates:

  • Testing
  • Deployment
  • Validation
  • Release management

Azure DevOps

Azure DevOps supports:

  • Build pipelines
  • Release pipelines
  • Source control
  • Automated deployments

GitHub Actions

GitHub Actions supports:

  • Workflow automation
  • CI/CD pipelines
  • Deployment automation

Commonly used for AI application deployments.


Blue-Green Deployments

Blue-green deployments reduce downtime during releases.

How it works:

  • One environment remains active
  • A second environment receives updates
  • Traffic shifts after validation

Benefits:

  • Safer releases
  • Reduced downtime
  • Easier rollback

Canary Deployments

Canary deployments release updates gradually to a small percentage of users.

Benefits:

  • Reduced deployment risk
  • Easier issue detection
  • Safer experimentation

Monitoring Deployment Health

AI deployments should monitor:

  • Latency
  • Throughput
  • Token usage
  • Errors
  • Model failures
  • Tool call failures
  • Retrieval quality

Azure Monitor

Azure Monitor provides:

  • Metrics
  • Logging
  • Alerts
  • Diagnostics

Application Insights

Application Insights supports:

  • Telemetry
  • Request tracing
  • Dependency tracking
  • Error diagnostics

Cost Optimization Considerations

AI deployments can become expensive.


Common Cost Drivers

  • Token consumption
  • GPU usage
  • High-scale orchestration
  • Search indexing
  • Storage
  • Data transfer

Cost Optimization Strategies

Use Smaller Models When Appropriate

Smaller models reduce:

  • Compute costs
  • Token usage
  • Latency

Use Serverless When Appropriate

Serverless deployments reduce idle infrastructure costs.


Use Autoscaling

Autoscaling prevents overprovisioning.


Common AI-103 Deployment Scenarios

Scenario 1: Enterprise AI Chatbot

Requirements:

  • High availability
  • Secure authentication
  • Enterprise search

Recommended Deployment:

  • Azure OpenAI
  • App Service
  • Azure AI Search
  • Entra ID

Scenario 2: Large-Scale AI Agent Platform

Requirements:

  • Multiple AI agents
  • Heavy orchestration
  • High concurrency

Recommended Deployment:

  • AKS
  • Azure Functions
  • Cosmos DB
  • Prompt Flow

Scenario 3: Lightweight AI API

Requirements:

  • Rapid deployment
  • Cost optimization
  • Moderate scale

Recommended Deployment:

  • Azure Functions
  • Container Apps

Scenario 4: Global AI Application

Requirements:

  • Global users
  • Low latency
  • Disaster recovery

Recommended Deployment:

  • Multi-region deployment
  • Availability Zones
  • Load balancing

Common AI-103 Exam Tips

Understand Deployment Tradeoffs

Know when to use:

  • App Service vs AKS
  • Functions vs Containers
  • Standard vs Provisioned Throughput

Know High Availability Concepts

Understand:

  • Availability Zones
  • Multi-region deployments
  • Load balancing
  • Failover strategies

Learn Security Best Practices

Know how to use:

  • Entra ID
  • RBAC
  • Key Vault
  • Private Endpoints

Understand Agent Deployment Needs

AI agents commonly require:

  • Tool orchestration
  • Retrieval systems
  • Persistent memory
  • API integrations

Summary

Choosing the correct deployment option is critical for successful AI applications and agent-based systems.

For the AI-103 exam, you should understand:

  • Azure deployment models
  • Serverless deployment options
  • Kubernetes deployments
  • PaaS hosting options
  • Multi-region architectures
  • High availability design
  • Security considerations
  • CI/CD pipelines
  • Scaling strategies
  • AI deployment tradeoffs

Strong deployment architecture skills help ensure AI systems are:

  • Reliable
  • Scalable
  • Secure
  • Cost-effective
  • Maintainable

Practice Exam Questions

Question 1

Which Azure service is BEST suited for enterprise-scale container orchestration for AI applications?

A. Azure App Service
B. Azure Kubernetes Service (AKS)
C. Azure DNS
D. Azure Backup

Answer

B. Azure Kubernetes Service (AKS)

Explanation

AKS provides enterprise-grade container orchestration and scalability.


Question 2

Which deployment option provides dedicated throughput capacity for Azure OpenAI models?

A. Shared deployment
B. Provisioned throughput deployment
C. Consumption deployment
D. Basic deployment

Answer

B. Provisioned throughput deployment

Explanation

Provisioned throughput reserves dedicated model processing capacity.


Question 3

Which Azure service is MOST appropriate for lightweight event-driven AI workflows?

A. Azure Functions
B. Azure Firewall
C. Azure Backup
D. Azure CDN

Answer

A. Azure Functions

Explanation

Azure Functions supports serverless event-driven execution.


Question 4

What is the primary benefit of Availability Zones?

A. Lower token usage
B. Increased embedding size
C. Improved fault tolerance
D. Reduced API authentication

Answer

C. Improved fault tolerance

Explanation

Availability Zones provide redundancy across isolated datacenters.


Question 5

Which Azure service is commonly used to host AI web applications and APIs with minimal infrastructure management?

A. Azure App Service
B. Azure Load Balancer
C. Azure DNS
D. Azure Monitor

Answer

A. Azure App Service

Explanation

Azure App Service is a managed PaaS platform for hosting web applications and APIs.


Question 6

Which deployment strategy gradually releases updates to a subset of users first?

A. Blue-green deployment
B. Canary deployment
C. Full rollback deployment
D. Batch deployment

Answer

B. Canary deployment

Explanation

Canary deployments release updates incrementally to reduce risk.


Question 7

Which Azure service securely stores API keys and secrets for AI applications?

A. Azure Key Vault
B. Azure CDN
C. Azure Firewall
D. Azure Backup

Answer

A. Azure Key Vault

Explanation

Azure Key Vault securely manages secrets and credentials.


Question 8

Which Azure deployment option is MOST appropriate for serverless container hosting?

A. Azure Container Apps
B. Azure Backup
C. Azure DNS
D. Azure Files

Answer

A. Azure Container Apps

Explanation

Azure Container Apps provides simplified serverless container deployment.


Question 9

Which deployment architecture improves global application availability and disaster recovery?

A. Single-region deployment
B. Multi-region deployment
C. Local-only deployment
D. Single-container deployment

Answer

B. Multi-region deployment

Explanation

Multi-region deployments improve resilience and geographic performance.


Question 10

Which Azure monitoring service provides application telemetry and request diagnostics?

A. Application Insights
B. Azure CDN
C. Azure DNS
D. Azure Policy

Answer

A. Application Insights

Explanation

Application Insights provides monitoring and telemetry for applications.


Go to the AI-103 Exam Prep Hub main page

Design Azure infrastructure for AI Apps and agent-based solutions (AI-103 Exam Prep)

This post is a part of the AI-103: Develop AI Apps and Agents on Azure Exam Prep Hub. 
This topic falls under these sections:
Plan and manage an Azure AI solution (25–30%)
--> Set up AI solutions in Foundry
--> Design Azure infrastructure for AI Apps and agent-based solutions


Note that there are 10 practice questions (with answers and explanations) at the end of each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available from the hub's main page below the exam topics section.

Introduction

Designing infrastructure for AI applications and agent-based systems is one of the most important responsibilities for Azure AI developers.

Modern AI solutions are not simply standalone models. They are distributed cloud systems that combine:

  • AI services
  • APIs
  • Databases
  • Search systems
  • Storage
  • Networking
  • Security controls
  • Monitoring systems
  • Agent orchestration components

The AI-103: Develop AI Apps and Agents on Azure certification exam tests your ability to design Azure infrastructure that supports:

  • Generative AI applications
  • AI agents
  • Retrieval-Augmented Generation (RAG)
  • Vector search
  • Multimodal AI systems
  • Scalable AI architectures
  • Secure enterprise AI deployments

For the AI-103 exam, you should understand:

  • Core Azure infrastructure services
  • AI architecture patterns
  • Scalability and performance design
  • Networking and security
  • Identity and access management
  • Storage and databases
  • Monitoring and observability
  • Cost optimization
  • High availability and disaster recovery
  • Infrastructure choices for AI agents

Core Components of AI Infrastructure

AI applications commonly require multiple infrastructure layers.

Typical components include:

  1. AI model services
  2. Compute resources
  3. Storage systems
  4. Search and retrieval systems
  5. Networking components
  6. Security services
  7. Monitoring systems
  8. Workflow orchestration
  9. API management
  10. Identity management

Azure AI Services Layer

Azure OpenAI

Azure OpenAI provides:

  • Large Language Models (LLMs)
  • Embedding models
  • Multimodal models
  • Conversational AI capabilities

Azure OpenAI is commonly used for:

  • AI copilots
  • Chatbots
  • AI agents
  • Summarization
  • Content generation
  • Tool calling

Azure AI Search

Azure AI Search supports:

  • Vector search
  • Semantic search
  • Hybrid search
  • Enterprise retrieval
  • RAG architectures

It is commonly used for:

  • Knowledge grounding
  • Enterprise search
  • AI assistant retrieval

Azure AI Vision

Azure AI Vision provides:

  • OCR
  • Image analysis
  • Object detection
  • Caption generation
  • Visual understanding

Azure AI Document Intelligence

Azure AI Document Intelligence supports:

  • Invoice extraction
  • Form processing
  • Layout analysis
  • OCR workflows
  • Structured document extraction

Compute Infrastructure for AI Applications

Azure App Service

Azure App Service is commonly used to host:

  • Web applications
  • AI front ends
  • APIs
  • Lightweight AI services

Advantages:

  • Managed platform
  • Easy scaling
  • Simplified deployment

Azure Kubernetes Service (AKS)

AKS provides container orchestration for:

  • Large-scale AI applications
  • Microservices
  • Agent orchestration systems
  • Distributed AI workloads

Advantages:

  • High scalability
  • Container management
  • Advanced orchestration
  • Enterprise-grade deployments

When to Use AKS

Use AKS when:

  • Complex orchestration is required
  • Multiple services interact
  • High scalability is needed
  • Microservice architectures are used

Azure Functions

Azure Functions provides serverless compute.

Common AI use cases:

  • Tool execution
  • Event-driven workflows
  • API integrations
  • Lightweight processing
  • Agent tool calling

Advantages:

  • Pay-per-use pricing
  • Automatic scaling
  • Fast development

Azure Container Apps

Azure Container Apps provides simplified container hosting.

Useful for:

  • API services
  • AI middleware
  • Lightweight agent services
  • Event-driven AI components

Choosing the Correct Compute Service

Use Azure App Service When:

  • Hosting simple AI web apps
  • Managing APIs
  • Rapid deployment is needed

Use AKS When:

  • Large-scale orchestration is required
  • Complex microservices exist
  • Advanced scalability is necessary

Use Azure Functions When:

  • Event-driven execution is needed
  • Tool calling is required
  • Lightweight compute is sufficient

Use Azure Container Apps When:

  • Container simplicity is preferred
  • Serverless containers are desired

Storage Infrastructure

AI systems often require multiple storage solutions.


Azure Blob Storage

Azure Blob Storage supports:

  • Document storage
  • Training data
  • Images
  • Videos
  • Logs
  • AI datasets

Common AI uses:

  • RAG document storage
  • Knowledge repositories
  • Media storage

Azure Cosmos DB

Azure Cosmos DB provides:

  • Globally distributed NoSQL storage
  • Low-latency access
  • High scalability

Common AI uses:

  • Agent memory
  • Session storage
  • User profiles
  • Conversation history

Azure SQL Database

Azure SQL Database supports:

  • Structured enterprise data
  • Relational workloads
  • Transactional systems

Common AI uses:

  • Enterprise integration
  • Business systems
  • Structured metadata

Vector Storage

Vector-enabled storage supports:

  • Embedding storage
  • Similarity search
  • Semantic retrieval

Common services include:

  • Azure AI Search
  • Azure Cosmos DB
  • Azure SQL Database

Networking Infrastructure

AI solutions require secure and scalable networking.


Virtual Networks (VNets)

VNets provide:

  • Network isolation
  • Secure communication
  • Private connectivity

Use VNets when:

  • Enterprise security is required
  • Private networking is necessary
  • Sensitive data is involved

Private Endpoints

Private Endpoints allow Azure services to be accessed privately through VNets.

Benefits:

  • Improved security
  • Reduced public exposure
  • Enterprise compliance support

API Management

Azure API Management helps:

  • Secure APIs
  • Throttle requests
  • Monitor API usage
  • Apply policies
  • Manage agent APIs

This is important for:

  • AI agents
  • Tool integrations
  • Enterprise API governance

Load Balancing

Azure Load Balancer and Application Gateway help:

  • Distribute traffic
  • Improve availability
  • Scale AI applications

Identity and Security

Security is a major AI-103 exam topic.


Microsoft Entra ID

Microsoft Entra ID provides:

  • Authentication
  • Authorization
  • Identity management
  • Role-based access control (RBAC)

AI applications use Entra ID for:

  • User authentication
  • API access control
  • Secure enterprise integration

Role-Based Access Control (RBAC)

RBAC ensures users and services only access authorized resources.

Examples:

  • Restricting AI model access
  • Controlling storage access
  • Securing search indexes

Azure Key Vault

Azure Key Vault stores:

  • Secrets
  • API keys
  • Certificates
  • Connection strings

Never hardcode secrets in AI applications.


Azure AI Content Safety

Azure AI Content Safety helps:

  • Detect harmful content
  • Filter unsafe outputs
  • Support responsible AI practices

Monitoring and Observability

AI systems require monitoring for:

  • Reliability
  • Performance
  • Cost
  • Failures
  • Hallucinations
  • API latency

Azure Monitor

Azure Monitor collects:

  • Metrics
  • Logs
  • Alerts
  • Performance data

Application Insights

Application Insights supports:

  • Application telemetry
  • Request tracing
  • Error tracking
  • Dependency monitoring

Useful for:

  • AI apps
  • APIs
  • Agent workflows

Logging AI Systems

AI systems should log:

  • Prompts
  • Responses
  • Errors
  • Tool calls
  • Latency
  • Retrieval quality

Logging helps:

  • Troubleshooting
  • Auditing
  • Evaluation
  • Compliance

Scalability Design

AI applications may experience:

  • High traffic
  • Large token volumes
  • Heavy retrieval workloads
  • Concurrent agent operations

Infrastructure must scale effectively.


Horizontal Scaling

Horizontal scaling adds more instances.

Examples:

  • Additional API servers
  • More containers
  • More worker nodes

Vertical Scaling

Vertical scaling increases resource capacity.

Examples:

  • More CPU
  • More memory
  • Larger VM sizes

Autoscaling

Autoscaling dynamically adjusts resources based on demand.

Common services supporting autoscaling:

  • AKS
  • Azure Functions
  • App Service
  • Container Apps

High Availability and Disaster Recovery

Enterprise AI systems require resilience.


Availability Zones

Availability Zones improve fault tolerance.

Benefits:

  • Redundancy
  • Improved uptime
  • Reduced outage risk

Geo-Redundancy

Geo-redundancy replicates data across regions.

Useful for:

  • Disaster recovery
  • Business continuity
  • Global applications

Backup and Recovery

AI systems should back up:

  • Knowledge indexes
  • Databases
  • Configuration data
  • Logs
  • Agent memory

Infrastructure for AI Agents

AI agents often require additional infrastructure components.


Agent Orchestration

AI agents may require orchestration services such as:

  • Prompt Flow
  • Azure Functions
  • Logic Apps
  • AKS workflows

Retrieval Infrastructure

Agent systems commonly use:

  • Azure AI Search
  • Embeddings
  • Vector indexes
  • RAG pipelines

Persistent Memory Infrastructure

Persistent memory may use:

  • Azure Cosmos DB
  • Azure SQL Database
  • Blob Storage

Tool Integration Infrastructure

Agents often integrate with:

  • REST APIs
  • Databases
  • External SaaS systems
  • Enterprise workflows

Common AI-103 Architecture Scenarios

Scenario 1: Enterprise AI Copilot

Requirements:

  • Conversational AI
  • Enterprise search
  • Secure authentication
  • Document retrieval

Recommended Infrastructure:

  • Azure OpenAI
  • Azure AI Search
  • Entra ID
  • Blob Storage
  • App Service

Scenario 2: Large-Scale Multi-Agent System

Requirements:

  • Multiple AI agents
  • High scalability
  • Distributed orchestration

Recommended Infrastructure:

  • AKS
  • Azure Functions
  • Prompt Flow
  • Cosmos DB

Scenario 3: AI Invoice Processing Solution

Requirements:

  • OCR
  • Document extraction
  • Workflow automation

Recommended Infrastructure:

  • Azure AI Document Intelligence
  • Blob Storage
  • Logic Apps
  • Azure Functions

Scenario 4: Global AI Chat Platform

Requirements:

  • Global availability
  • High concurrency
  • Disaster recovery

Recommended Infrastructure:

  • Geo-redundant storage
  • Availability Zones
  • Load balancing
  • Autoscaling

Cost Optimization Considerations

AI infrastructure can become expensive.


Common Cost Drivers

  • Token usage
  • Vector storage
  • GPU workloads
  • Data transfer
  • Search indexing
  • High-scale orchestration

Cost Optimization Strategies

Use Smaller Models When Appropriate

Smaller models reduce:

  • Compute usage
  • Token costs
  • Latency

Use Autoscaling

Autoscaling reduces idle resource costs.


Optimize Retrieval Pipelines

Efficient chunking and indexing reduce:

  • Search costs
  • Storage requirements
  • Retrieval latency

Common AI-103 Exam Tips

Understand Infrastructure Tradeoffs

Know when to use:

  • AKS vs App Service
  • Functions vs Containers
  • Cosmos DB vs SQL Database

Learn Security Best Practices

Know how to use:

  • Entra ID
  • RBAC
  • Key Vault
  • Private Endpoints

Understand RAG Infrastructure

RAG commonly uses:

  • Azure OpenAI
  • Azure AI Search
  • Embeddings
  • Storage systems

Know Agent Infrastructure Patterns

AI agents commonly require:

  • Workflow orchestration
  • Tool integration
  • Persistent memory
  • Retrieval systems

Summary

Designing Azure infrastructure for AI applications requires balancing:

  • Scalability
  • Security
  • Performance
  • Cost
  • Reliability
  • Maintainability

For the AI-103 exam, you should understand:

  • Azure AI service architecture
  • Compute options
  • Storage design
  • Networking and security
  • Monitoring and observability
  • High availability
  • Agent infrastructure patterns
  • RAG infrastructure
  • Infrastructure scaling strategies

Strong infrastructure design skills are essential for deploying production-grade AI apps and agent-based systems on Azure.


Practice Exam Questions

Question 1

Which Azure service is MOST appropriate for enterprise vector search and RAG retrieval?

A. Azure AI Search
B. Azure Backup
C. Azure CDN
D. Azure DNS

Answer

A. Azure AI Search

Explanation

Azure AI Search supports vector search, semantic search, and retrieval for RAG systems.


Question 2

Which Azure compute service is BEST suited for large-scale containerized AI microservices?

A. Azure App Service
B. Azure Kubernetes Service (AKS)
C. Azure Files
D. Azure CDN

Answer

B. Azure Kubernetes Service (AKS)

Explanation

AKS provides advanced container orchestration and scalability.


Question 3

Which Azure service is MOST appropriate for storing API keys and secrets securely?

A. Azure Key Vault
B. Azure Monitor
C. Azure DNS
D. Azure Load Balancer

Answer

A. Azure Key Vault

Explanation

Azure Key Vault securely stores secrets, certificates, and keys.


Question 4

Which Azure service provides serverless execution for lightweight AI workflows and tool calling?

A. Azure Functions
B. Azure Backup
C. Azure CDN
D. Azure Firewall

Answer

A. Azure Functions

Explanation

Azure Functions supports event-driven serverless compute.


Question 5

What is the primary purpose of Availability Zones?

A. Reduce token usage
B. Improve fault tolerance and uptime
C. Replace backups
D. Encrypt embeddings

Answer

B. Improve fault tolerance and uptime

Explanation

Availability Zones provide redundancy across isolated datacenter locations.


Question 6

Which Azure service is MOST commonly used for globally distributed NoSQL storage in AI applications?

A. Azure Cosmos DB
B. Azure DNS
C. Azure Files
D. Azure CDN

Answer

A. Azure Cosmos DB

Explanation

Azure Cosmos DB provides scalable globally distributed NoSQL storage.


Question 7

Which Azure networking feature enables private access to Azure services from a VNet?

A. Private Endpoint
B. Public IP
C. Load Balancer
D. Traffic Manager

Answer

A. Private Endpoint

Explanation

Private Endpoints provide secure private connectivity.


Question 8

Which Azure monitoring service provides application telemetry and request tracing?

A. Application Insights
B. Azure CDN
C. Azure Policy
D. Azure ExpressRoute

Answer

A. Application Insights

Explanation

Application Insights provides telemetry and diagnostics for applications.


Question 9

Which Azure identity service provides authentication and RBAC support for AI applications?

A. Microsoft Entra ID
B. Azure CDN
C. Azure Firewall
D. Azure Front Door

Answer

A. Microsoft Entra ID

Explanation

Microsoft Entra ID provides identity and access management.


Question 10

Which scaling strategy adds additional instances to support increased AI workload demand?

A. Vertical scaling
B. Horizontal scaling
C. Encryption scaling
D. Semantic scaling

Answer

B. Horizontal scaling

Explanation

Horizontal scaling adds more instances to distribute workloads.


Go to the AI-103 Exam Prep Hub main page

Choose appropriate memory, tool, and knowledge integration services for agent solutions (AI-103 Exam Prep)

This post is a part of the AI-103: Develop AI Apps and Agents on Azure Exam Prep Hub. 
This topic falls under these sections:
Plan and manage an Azure AI solution (25–30%)
--> Choose the appropriate Foundry services for generative AI and agents
--> Choose appropriate memory, tool, and knowledge integration services for agent solutions


Note that there are 10 practice questions (with answers and explanations) at the end of each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available from the hub's main page below the exam topics section.

Introduction

Modern AI agents are far more advanced than traditional chatbots.

AI agents can:

  • Reason through problems
  • Plan tasks
  • Access tools
  • Retrieve knowledge
  • Maintain conversational memory
  • Execute workflows
  • Interact with enterprise systems
  • Coordinate multi-step operations

The AI-103: Develop AI Apps and Agents on Azure certification exam places significant emphasis on understanding how to design and implement these agent capabilities using Azure AI Foundry and related Azure services.

One of the most important skills tested on the exam is the ability to choose appropriate:

  • Memory systems
  • Tool integration services
  • Knowledge integration services
  • Retrieval architectures
  • Agent orchestration tools

For the AI-103 exam, you should understand:

  • Different types of agent memory
  • Tool calling and function calling
  • Retrieval-Augmented Generation (RAG)
  • Knowledge grounding
  • Azure AI Search integration
  • Agent orchestration workflows
  • External API integration
  • Vector search and embeddings
  • Enterprise knowledge integration
  • Security and governance considerations

What Are AI Agents?

AI agents are AI-powered systems capable of:

  • Interpreting goals
  • Planning actions
  • Using tools
  • Retrieving information
  • Maintaining context
  • Completing tasks autonomously or semi-autonomously

Unlike traditional chatbots, AI agents can:

  • Interact with APIs
  • Execute workflows
  • Use memory
  • Retrieve enterprise knowledge
  • Chain actions together
  • Adapt dynamically to user requests

Components of an AI Agent Architecture

Modern AI agent solutions commonly include:

  1. Large Language Models (LLMs)
  2. Memory systems
  3. Retrieval systems
  4. Knowledge integration
  5. Tool and function calling
  6. Workflow orchestration
  7. Security and governance controls

Azure AI Foundry and Agent Solutions

Azure AI Foundry provides services and tools that help developers:

  • Build AI agents
  • Integrate tools
  • Connect enterprise knowledge
  • Implement RAG
  • Orchestrate workflows
  • Evaluate agent behavior
  • Monitor AI systems

Core services often include:

  • Azure OpenAI
  • Azure AI Search
  • Prompt Flow
  • Azure AI Content Safety
  • Azure Functions
  • Azure Logic Apps
  • Azure Cosmos DB
  • Azure SQL Database

Memory in AI Agents

What Is Agent Memory?

Memory enables AI agents to retain and use information over time.

Memory allows agents to:

  • Maintain conversational context
  • Remember user preferences
  • Track workflow state
  • Store historical interactions
  • Support long-running tasks

Without memory, every interaction becomes isolated.


Types of Agent Memory

The AI-103 exam may test multiple memory types.


Short-Term Memory

What Is Short-Term Memory?

Short-term memory stores temporary conversational context.

Examples:

  • Current chat history
  • Active task context
  • Immediate instructions

Characteristics of Short-Term Memory

  • Session-based
  • Temporary
  • Fast access
  • Often stored in prompts or session state

When to Use Short-Term Memory

Use short-term memory for:

  • Conversational continuity
  • Current workflow tracking
  • Multi-turn conversations

Long-Term Memory

What Is Long-Term Memory?

Long-term memory stores persistent information across sessions.

Examples:

  • User preferences
  • Historical interactions
  • Persistent profiles
  • Prior decisions

Characteristics of Long-Term Memory

  • Persistent storage
  • Cross-session continuity
  • Larger storage capacity
  • Supports personalization

Azure Services for Long-Term Memory

Common services include:

  • Azure Cosmos DB
  • Azure SQL Database
  • Azure Storage
  • Vector databases

When to Use Long-Term Memory

Use long-term memory when:

  • Personalization is required
  • User preferences must persist
  • Historical context matters
  • Long-running workflows exist

Semantic Memory

What Is Semantic Memory?

Semantic memory stores knowledge in embeddings or vectorized formats.

This enables:

  • Semantic retrieval
  • Knowledge recall
  • Contextual understanding
  • Similarity matching

Semantic Memory in AI Agents

Semantic memory often uses:

  • Embedding models
  • Vector search
  • Azure AI Search

This allows agents to retrieve relevant information dynamically.


Episodic Memory

What Is Episodic Memory?

Episodic memory stores records of past interactions and events.

Examples:

  • Past conversations
  • Completed workflows
  • User activity history

This helps agents maintain continuity across interactions.


Choosing the Correct Memory Type

Use Short-Term Memory When:

  • Managing active conversations
  • Maintaining immediate context
  • Supporting temporary tasks

Use Long-Term Memory When:

  • Storing persistent user information
  • Personalizing experiences
  • Maintaining history across sessions

Use Semantic Memory When:

  • Retrieving knowledge semantically
  • Supporting RAG
  • Performing contextual retrieval

Use Episodic Memory When:

  • Tracking prior interactions
  • Supporting historical continuity

Knowledge Integration

What Is Knowledge Integration?

Knowledge integration connects AI agents to external information sources.

Examples:

  • Enterprise documents
  • Databases
  • Knowledge bases
  • APIs
  • Websites
  • Internal systems

Knowledge integration helps agents:

  • Provide grounded answers
  • Access current information
  • Reduce hallucinations
  • Support enterprise use cases

Retrieval-Augmented Generation (RAG)

What Is RAG?

RAG combines:

  • Retrieval systems
  • Search indexes
  • Embeddings
  • LLMs

RAG enables agents to retrieve external information before generating responses.


Azure AI Search for Knowledge Integration

Azure AI Search is a core service for:

  • Vector search
  • Semantic search
  • Hybrid search
  • Enterprise retrieval
  • Knowledge grounding

It enables agents to:

  • Search enterprise documents
  • Retrieve semantically relevant content
  • Access indexed knowledge

Hybrid Search

Hybrid search combines:

  • Keyword search
  • Semantic ranking
  • Vector search

Hybrid search is often the preferred approach for enterprise AI agents.


Embeddings and Knowledge Retrieval

Embedding models convert content into vector representations.

Embeddings support:

  • Semantic similarity
  • Vector retrieval
  • Knowledge recall
  • RAG pipelines

Azure OpenAI embedding models are commonly used.


Knowledge Sources for AI Agents

AI agents may integrate with:

  • Azure Blob Storage
  • SharePoint
  • Databases
  • REST APIs
  • Enterprise document repositories
  • CRM systems
  • ERP systems

Tool Integration

What Is Tool Integration?

Tool integration enables AI agents to interact with external systems.

Examples include:

  • APIs
  • Databases
  • Email systems
  • Calendars
  • Search services
  • Workflow systems

Tool integration allows agents to perform actions instead of only generating text.


Tool Calling and Function Calling

LLMs can invoke:

  • Tools
  • Functions
  • APIs

Examples:

  • Retrieve weather data
  • Send emails
  • Query databases
  • Create support tickets
  • Execute workflows

Azure Services for Tool Integration

Common services include:

  • Azure Functions
  • Azure Logic Apps
  • REST APIs
  • Azure API Management

Azure Functions

Azure Functions provides serverless compute for:

  • API integrations
  • Business logic
  • Event-driven workflows
  • Tool execution

AI agents often call Azure Functions to execute tasks.


Azure Logic Apps

Azure Logic Apps supports:

  • Workflow automation
  • Enterprise integrations
  • Connector-based orchestration

Logic Apps are useful when:

  • Multiple systems must interact
  • Low-code orchestration is preferred
  • Enterprise automation is needed

Azure API Management

Azure API Management helps:

  • Secure APIs
  • Manage API access
  • Monitor API usage
  • Apply governance policies

Useful for enterprise AI agent integrations.


Prompt Flow

Prompt Flow is a Foundry tool for:

  • Building AI workflows
  • Orchestrating prompts
  • Chaining tools
  • Managing agent pipelines
  • Evaluating workflows

Prompt Flow is a major AI-103 exam topic.


Multi-Agent Systems

Some AI architectures use multiple specialized agents.

Examples:

  • Research agent
  • Scheduling agent
  • Data retrieval agent
  • Customer service agent

Multi-agent systems may improve:

  • Scalability
  • Specialization
  • Workflow separation

Orchestration Services

Agent orchestration coordinates:

  • Memory
  • Retrieval
  • Tool execution
  • Workflow management

Common orchestration tools include:

  • Prompt Flow
  • Azure Functions
  • Logic Apps
  • Custom orchestration frameworks

Security and Governance

AI agent systems require:

  • Authentication
  • Authorization
  • Data protection
  • Content filtering
  • Responsible AI controls

Azure AI Content Safety

Azure AI Content Safety helps:

  • Detect harmful content
  • Prevent unsafe outputs
  • Support responsible AI deployments

Role-Based Access Control (RBAC)

RBAC ensures agents only access authorized resources.

This is especially important for:

  • Enterprise knowledge systems
  • Confidential data
  • Regulated environments

Monitoring and Observability

AI agent systems should monitor:

  • Tool usage
  • Latency
  • Errors
  • Retrieval quality
  • Hallucinations
  • Token usage

Monitoring improves:

  • Reliability
  • Performance
  • Troubleshooting

Common AI-103 Scenarios

Scenario 1: Enterprise Copilot

Requirements:

  • Access enterprise documents
  • Remember user preferences
  • Retrieve current information
  • Support conversational interactions

Recommended Services:

  • Azure OpenAI
  • Azure AI Search
  • Embedding models
  • Long-term memory storage

Scenario 2: AI Travel Assistant

Requirements:

  • Access calendars
  • Book hotels
  • Query APIs
  • Manage workflows

Recommended Services:

  • Azure OpenAI
  • Tool/function calling
  • Azure Functions
  • Prompt Flow

Scenario 3: Customer Support Agent

Requirements:

  • Retrieve support documents
  • Track prior interactions
  • Escalate tickets

Recommended Services:

  • Azure AI Search
  • Episodic memory
  • Azure Functions
  • CRM integration

Scenario 4: Personalized Learning Assistant

Requirements:

  • Remember learning preferences
  • Track progress
  • Recommend materials

Recommended Services:

  • Long-term memory
  • Semantic retrieval
  • Azure Cosmos DB

Common AI-103 Exam Tips

Understand Memory Types

Know the differences between:

  • Short-term memory
  • Long-term memory
  • Semantic memory
  • Episodic memory

Know When to Use RAG

Use RAG when:

  • External knowledge is required
  • Current data is needed
  • Hallucination reduction matters

Learn Tool Calling Concepts

Agents use:

  • Function calling
  • APIs
  • Workflows
  • Tool orchestration

This is commonly tested.


Understand Azure Service Roles

Azure AI Search

Used for:

  • Retrieval
  • Vector search
  • Grounding

Azure Functions

Used for:

  • Executing logic
  • Tool integration

Prompt Flow

Used for:

  • Workflow orchestration
  • Agent pipelines

Azure Cosmos DB

Used for:

  • Persistent memory
  • Long-term storage

Summary

AI agents require more than just language models.

Successful agent solutions combine:

  • Memory systems
  • Retrieval systems
  • Knowledge grounding
  • Tool integration
  • Workflow orchestration
  • Security controls

For the AI-103 exam, you should understand:

  • Different memory architectures
  • Tool and function calling
  • RAG workflows
  • Azure AI Search integration
  • Knowledge retrieval strategies
  • Prompt Flow orchestration
  • Persistent memory services
  • Enterprise AI integration patterns

Understanding how these services work together is critical for building scalable and intelligent AI agent solutions.


Practice Exam Questions

Question 1

Which type of memory is MOST appropriate for maintaining conversational context during a single chat session?

A. Long-term memory
B. Semantic memory
C. Short-term memory
D. Episodic memory

Answer

C. Short-term memory

Explanation

Short-term memory maintains active conversational context within a session.


Question 2

Which Azure service is MOST commonly used for semantic retrieval and grounding in AI agents?

A. Azure AI Search
B. Azure Backup
C. Azure DNS
D. Azure Firewall

Answer

A. Azure AI Search

Explanation

Azure AI Search provides vector search and semantic retrieval capabilities.


Question 3

What is the primary purpose of Retrieval-Augmented Generation (RAG)?

A. Replace embeddings
B. Reduce retrieval latency only
C. Ground responses using retrieved information
D. Eliminate vector search

Answer

C. Ground responses using retrieved information

Explanation

RAG retrieves external information to improve groundedness and reduce hallucinations.


Question 4

Which Azure service is MOST appropriate for serverless tool execution within AI agents?

A. Azure Functions
B. Azure CDN
C. Azure Backup
D. Azure Policy

Answer

A. Azure Functions

Explanation

Azure Functions supports serverless execution of business logic and APIs.


Question 5

Which memory type stores knowledge using embeddings and vector representations?

A. Short-term memory
B. Semantic memory
C. Transactional memory
D. Procedural memory

Answer

B. Semantic memory

Explanation

Semantic memory stores information in vectorized forms for retrieval.


Question 6

Which Foundry tool is primarily used for orchestrating AI workflows and agent pipelines?

A. Azure Backup
B. Prompt Flow
C. Azure DNS
D. Azure Storage Explorer

Answer

B. Prompt Flow

Explanation

Prompt Flow supports workflow orchestration and prompt chaining.


Question 7

What is the primary advantage of long-term memory in AI agents?

A. Faster GPU performance
B. Persistent cross-session personalization
C. Lower token usage only
D. Reduced API calls

Answer

B. Persistent cross-session personalization

Explanation

Long-term memory enables persistent storage of preferences and history.


Question 8

Which Azure service is MOST appropriate for low-code workflow automation in enterprise agent systems?

A. Azure Logic Apps
B. Azure DNS
C. Azure Monitor
D. Azure DevTest Labs

Answer

A. Azure Logic Apps

Explanation

Azure Logic Apps provides low-code workflow orchestration and integrations.


Question 9

Which capability allows AI agents to invoke APIs and external systems dynamically?

A. OCR
B. Function calling
C. Metadata filtering
D. Image segmentation

Answer

B. Function calling

Explanation

Function calling enables AI models to interact with external tools and services.


Question 10

Which Azure service is MOST appropriate for persistent scalable storage of AI agent memory?

A. Azure Cosmos DB
B. Azure CDN
C. Azure Firewall
D. Azure ExpressRoute

Answer

A. Azure Cosmos DB

Explanation

Azure Cosmos DB is commonly used for scalable persistent memory storage.


Go to the AI-103 Exam Prep Hub main page

Choose an appropriate method for retrieval and indexing (AI-103 Exam Prep)

This post is a part of the AI-103: Develop AI Apps and Agents on Azure Exam Prep Hub. 
This topic falls under these sections:
Plan and manage an Azure AI solution (25–30%)
--> Choose the appropriate Foundry services for generative AI and agents
--> Choose an appropriate method for retrieval and indexing


Note that there are 10 practice questions (with answers and explanations) at the end of each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available from the hub's main page below the exam topics section.

Introduction

One of the most important concepts in modern AI applications is the ability to retrieve the correct information efficiently and accurately.

The AI-103: Develop AI Apps and Agents on Azure certification exam heavily tests knowledge related to:

  • Retrieval methods
  • Indexing strategies
  • Vector search
  • Semantic search
  • Retrieval-Augmented Generation (RAG)
  • Hybrid search
  • Embeddings
  • Knowledge grounding

Modern AI systems are often only as effective as their retrieval systems.

Even highly advanced Large Language Models (LLMs) can:

  • Hallucinate
  • Provide outdated information
  • Miss relevant context

Retrieval and indexing systems solve these problems by providing grounded, relevant, and searchable information to AI applications.

For the AI-103 exam, you should understand:

  • Different retrieval methods
  • Different indexing approaches
  • When to use vector search
  • When keyword search is appropriate
  • When hybrid search is preferred
  • How embeddings support retrieval
  • How Azure AI Search supports enterprise AI systems
  • How RAG architectures work

What Is Retrieval?

Retrieval is the process of locating and returning relevant information from a data source.

Examples include:

  • Searching documents
  • Finding relevant knowledge articles
  • Retrieving product descriptions
  • Returning similar documents
  • Finding semantically related content

Retrieval is essential for:

  • AI copilots
  • Enterprise chatbots
  • Knowledge assistants
  • Search applications
  • Recommendation systems
  • AI agents

What Is Indexing?

Indexing is the process of organizing data to make retrieval efficient.

An index acts like a searchable map of content.

Without indexing:

  • Searches are slower
  • Retrieval is inefficient
  • AI systems scale poorly

Indexes may include:

  • Keywords
  • Metadata
  • Embeddings
  • Semantic relationships
  • Document structure

Why Retrieval and Indexing Matter in AI

Modern generative AI applications often use Retrieval-Augmented Generation (RAG).

RAG combines:

  • Retrieval systems
  • Search indexes
  • Embeddings
  • LLMs

This allows AI systems to:

  • Access current information
  • Use enterprise knowledge
  • Reduce hallucinations
  • Provide grounded answers
  • Improve accuracy

Azure Services for Retrieval and Indexing

The primary Azure service for retrieval and indexing is:

  • Azure AI Search

Additional supporting services include:

  • Azure OpenAI
  • Embedding models
  • Azure Cosmos DB
  • Azure SQL Database
  • Azure Blob Storage

Azure AI Search

Azure AI Search is Microsoft’s enterprise search platform.

It supports:

  • Full-text search
  • Semantic search
  • Vector search
  • Hybrid search
  • AI enrichment
  • Indexing pipelines

Azure AI Search is a core AI-103 exam topic.


Retrieval Methods

There are several major retrieval methods you must understand for AI-103.


Keyword Search

What Is Keyword Search?

Keyword search retrieves documents based on exact word matches.

Example:

Searching for:

“cloud security”

Returns documents containing those exact terms.


Advantages of Keyword Search

  • Fast
  • Simple
  • Efficient for exact matches
  • Mature technology
  • Works well for structured terminology

Limitations of Keyword Search

Keyword search struggles with:

  • Synonyms
  • Contextual meaning
  • Natural language understanding
  • Conceptual similarity

Example:

A search for:

“car”

May not return documents containing:

“vehicle”


When to Use Keyword Search

Use keyword search when:

  • Exact term matching is important
  • Queries are highly structured
  • Performance and simplicity matter
  • Semantic understanding is unnecessary

Semantic Search

What Is Semantic Search?

Semantic search understands meaning and context rather than relying only on exact words.

It uses AI to interpret:

  • Intent
  • Context
  • Relationships between concepts

Example of Semantic Search

A query for:

“How do I secure cloud infrastructure?”

May retrieve documents about:

  • Azure security
  • Network protection
  • Cloud compliance

Even if the exact words differ.


Advantages of Semantic Search

  • Better contextual understanding
  • Improved relevance
  • More natural interactions
  • Better user experience

Limitations of Semantic Search

  • More computationally expensive
  • May increase latency
  • Requires more advanced indexing

When to Use Semantic Search

Use semantic search when:

  • Natural language queries are common
  • Relevance is important
  • Users may not know exact terminology
  • Context matters

Vector Search

What Is Vector Search?

Vector search retrieves information using embeddings.

Embeddings are numerical vector representations of content.

Documents with similar meaning have vectors that are mathematically close.


How Vector Search Works

  1. Documents are converted into embeddings
  2. Embeddings are stored in a vector index
  3. User queries are converted into embeddings
  4. Similarity algorithms identify related vectors
  5. Relevant documents are returned

Advantages of Vector Search

  • Excellent semantic similarity matching
  • Supports RAG architectures
  • Finds conceptually related content
  • Works well with natural language queries

Limitations of Vector Search

  • Higher storage requirements
  • More computational overhead
  • Requires embedding generation
  • More complex implementation

When to Use Vector Search

Use vector search when:

  • Building RAG systems
  • Implementing AI copilots
  • Performing semantic retrieval
  • Supporting conversational AI
  • Searching unstructured content

Hybrid Search

What Is Hybrid Search?

Hybrid search combines:

  • Keyword search
  • Semantic search
  • Vector search

This approach often produces the best retrieval quality.


Why Hybrid Search Matters

Hybrid search combines the strengths of multiple retrieval approaches.

Benefits include:

  • Exact keyword matching
  • Semantic understanding
  • Contextual similarity
  • Improved ranking quality

When to Use Hybrid Search

Use hybrid search when:

  • High retrieval quality is required
  • Enterprise search is needed
  • AI copilots require strong grounding
  • Search relevance is critical

Hybrid search is commonly used in production RAG systems.


Embeddings

What Are Embeddings?

Embeddings are numerical representations of data.

Embedding models transform:

  • Text
  • Images
  • Documents

Into vectors.

Embeddings capture semantic meaning.


Embedding Models

Azure OpenAI provides embedding models used for:

  • Vector search
  • Similarity matching
  • RAG systems
  • Recommendation systems

Chunking Strategies

What Is Chunking?

Chunking is the process of breaking large documents into smaller sections before indexing.

Chunking improves retrieval quality because:

  • Smaller chunks are easier to match
  • Context becomes more precise
  • Retrieval relevance improves

Common Chunking Methods

Fixed-Size Chunking

Documents are split into equal-sized chunks.

Advantages:

  • Simple
  • Easy to implement

Disadvantages:

  • May split important context

Semantic Chunking

Documents are split based on meaning or structure.

Advantages:

  • Better contextual integrity
  • Improved retrieval quality

Disadvantages:

  • More complex

Overlapping Chunks

Adjacent chunks share some content.

Advantages:

  • Preserves context continuity
  • Improves retrieval accuracy

Disadvantages:

  • Increased storage usage

Choosing a Chunking Strategy

Use Fixed-Size Chunking When:

  • Simplicity is important
  • Documents are uniform
  • Rapid implementation is needed

Use Semantic Chunking When:

  • Context preservation matters
  • Documents contain sections/topics
  • Retrieval quality is critical

Use Overlapping Chunks When:

  • Context continuity is important
  • Long-form content is indexed

Metadata Filtering

Indexes may include metadata such as:

  • Author
  • Date
  • Department
  • Category
  • Security level

Metadata filtering improves:

  • Precision
  • Security
  • Retrieval efficiency

Example Metadata Filtering Scenario

An enterprise chatbot retrieves only documents:

  • From HR
  • Created within the last year
  • Approved for employee access

Metadata filters help enforce these constraints.


Retrieval-Augmented Generation (RAG)

What Is RAG?

Retrieval-Augmented Generation combines retrieval systems with LLMs.

The workflow:

  1. User submits a query
  2. Query becomes an embedding
  3. Vector search retrieves relevant documents
  4. Retrieved content is added to the prompt
  5. LLM generates grounded response

Benefits of RAG

RAG helps:

  • Reduce hallucinations
  • Use current enterprise data
  • Avoid retraining models
  • Improve factual accuracy
  • Support enterprise AI assistants

Choosing Retrieval Methods for RAG

Keyword Search

Best for:

  • Exact terminology
  • Compliance searches
  • Structured queries

Vector Search

Best for:

  • Semantic similarity
  • Natural language queries
  • Conversational AI

Hybrid Search

Best for:

  • Enterprise copilots
  • High-quality retrieval
  • Production RAG systems

Indexing Pipelines

What Is an Indexing Pipeline?

An indexing pipeline automates:

  • Data ingestion
  • Document parsing
  • Chunking
  • Embedding generation
  • Metadata extraction
  • Index updates

AI Enrichment

Azure AI Search supports AI enrichment during indexing.

AI enrichment may include:

  • OCR
  • Entity extraction
  • Key phrase extraction
  • Language detection
  • Image analysis

Incremental Indexing

Incremental indexing updates only changed documents.

Benefits:

  • Faster indexing
  • Lower compute costs
  • Better scalability

Full Reindexing

Full reindexing rebuilds the entire index.

Use when:

  • Schema changes occur
  • Embedding models change
  • Large structural updates are required

Choosing an Indexing Strategy

Use Incremental Indexing When:

  • Data changes frequently
  • Efficiency matters
  • Large datasets exist

Use Full Reindexing When:

  • Major schema updates occur
  • Embedding strategy changes
  • Large-scale restructuring is required

Security and Access Control

Retrieval systems often include:

  • Role-based access control
  • Document-level security
  • Metadata-based filtering

This ensures users retrieve only authorized content.


Common AI-103 Scenarios

Scenario 1: Enterprise Knowledge Assistant

Requirements:

  • Conversational search
  • Semantic retrieval
  • Enterprise grounding

Recommended Approach:

  • Azure AI Search
  • Embeddings
  • Hybrid search
  • RAG

Scenario 2: Compliance Document Search

Requirements:

  • Exact terminology
  • Legal references
  • Precision retrieval

Recommended Approach:

  • Keyword search
  • Metadata filtering

Scenario 3: AI Copilot

Requirements:

  • Natural language queries
  • Contextual retrieval
  • Strong relevance

Recommended Approach:

  • Hybrid search
  • Vector search
  • Embeddings

Scenario 4: Product Recommendation System

Requirements:

  • Similarity matching
  • Semantic relationships

Recommended Approach:

  • Embeddings
  • Vector search

Common AI-103 Exam Tips

Understand Retrieval Tradeoffs

Keyword Search

  • Fast
  • Exact matching
  • Weak semantic understanding

Semantic Search

  • Better contextual understanding
  • More advanced relevance

Vector Search

  • Best for semantic similarity
  • Requires embeddings

Hybrid Search

  • Often best overall retrieval quality

Know the Relationship Between Embeddings and Vector Search

Embeddings enable vector search.

Without embeddings, vector search cannot function.


Understand RAG Architectures

RAG combines:

  • Retrieval
  • Indexing
  • Vector search
  • LLMs

This is one of the MOST important AI-103 topics.


Learn Chunking Concepts

Chunking affects:

  • Retrieval quality
  • Context preservation
  • Index efficiency

Chunking questions commonly appear in scenario-based exam questions.


Summary

Retrieval and indexing are foundational components of modern AI systems.

For the AI-103 exam, you should understand:

  • Keyword search
  • Semantic search
  • Vector search
  • Hybrid search
  • Embeddings
  • Chunking strategies
  • Metadata filtering
  • Indexing pipelines
  • Incremental indexing
  • RAG architectures
  • Azure AI Search capabilities

Choosing the correct retrieval and indexing approach directly affects:

  • AI accuracy
  • Groundedness
  • Scalability
  • Cost
  • Performance
  • User experience

Strong retrieval systems are essential for enterprise AI copilots, chatbots, and AI agents.


Practice Exam Questions

Question 1

Which retrieval method relies primarily on exact word matching?

A. Vector search
B. Semantic search
C. Keyword search
D. Hybrid search

Answer

C. Keyword search

Explanation

Keyword search retrieves content using exact lexical matches.


Question 2

Which retrieval method uses embeddings to identify semantically similar content?

A. Keyword search
B. Vector search
C. Lexical search
D. Metadata search

Answer

B. Vector search

Explanation

Vector search uses embeddings to perform similarity matching.


Question 3

What is the primary benefit of Retrieval-Augmented Generation (RAG)?

A. Eliminates embeddings
B. Improves groundedness using retrieved information
C. Removes the need for indexing
D. Replaces semantic search

Answer

B. Improves groundedness using retrieved information

Explanation

RAG improves factual accuracy by grounding responses with retrieved data.


Question 4

Which Azure service is MOST commonly used for enterprise vector search?

A. Azure AI Search
B. Azure DNS
C. Azure Backup
D. Azure Load Balancer

Answer

A. Azure AI Search

Explanation

Azure AI Search provides vector indexing and retrieval capabilities.


Question 5

What is the purpose of chunking during indexing?

A. Encrypt documents
B. Break documents into smaller searchable sections
C. Compress embeddings
D. Eliminate metadata

Answer

B. Break documents into smaller searchable sections

Explanation

Chunking improves retrieval quality and contextual matching.


Question 6

Which search method combines vector search, semantic ranking, and keyword matching?

A. Binary search
B. Metadata search
C. Hybrid search
D. OCR search

Answer

C. Hybrid search

Explanation

Hybrid search combines multiple retrieval methods.


Question 7

What is the primary purpose of embeddings?

A. Encrypt data
B. Create semantic vector representations
C. Compress images
D. Improve OCR quality

Answer

B. Create semantic vector representations

Explanation

Embeddings convert content into vectors representing semantic meaning.


Question 8

Which chunking strategy helps preserve context continuity between adjacent chunks?

A. Fixed chunking
B. Metadata chunking
C. Overlapping chunks
D. Compression chunking

Answer

C. Overlapping chunks

Explanation

Overlapping chunks preserve continuity across document sections.


Question 9

When is incremental indexing MOST appropriate?

A. When rebuilding the entire schema
B. When documents change frequently
C. When changing embedding models
D. When deleting the index

Answer

B. When documents change frequently

Explanation

Incremental indexing updates only modified documents.


Question 10

Which retrieval approach is MOST appropriate for enterprise AI copilots requiring high-quality relevance?

A. Keyword search only
B. Hybrid search
C. Metadata filtering only
D. OCR search

Answer

B. Hybrid search

Explanation

Hybrid search combines multiple retrieval methods for improved relevance.


Go to the AI-103 Exam Prep Hub main page

Choose the appropriate Foundry Services for generative tasks, Grounding, Vector Search, Agent Workflows, or Multimodal Processing (AI-103 Exam Prep)

This post is a part of the AI-103: Develop AI Apps and Agents on Azure Exam Prep Hub. 
This topic falls under these sections:
Plan and manage an Azure AI solution (25–30%)
--> Choose the appropriate Foundry services for generative AI and agents
--> Choose the Appropriate Foundry Services for generative tasks, Grounding, Vector Search, Agent Workflows, or Multimodal Processing


Note that there are 10 practice questions (with answers and explanations) at the end of each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available from the hub's main page below the exam topics section.

Introduction

One of the core responsibilities of an Azure AI developer is selecting the correct Azure AI Foundry services and supporting Azure technologies for specific AI workloads.

The AI-103 certification exam places significant emphasis on understanding how Azure AI Foundry services support:

  • Generative AI tasks
  • Grounding and Retrieval-Augmented Generation (RAG)
  • Vector search
  • AI agent workflows
  • Multimodal processing

Modern AI solutions are composed of multiple services working together rather than a single AI model.

For example:

  • A chatbot may require an LLM, vector search, embeddings, grounding, and agent orchestration.
  • A document assistant may require multimodal processing, OCR, embeddings, and RAG.
  • An AI agent may require tool calling, memory, orchestration, and workflow management.

Understanding which Foundry services to use in each scenario is critical both for the AI-103 exam and for real-world Azure AI development.


What Is Azure AI Foundry?

Azure AI Foundry is Microsoft’s unified AI development platform for:

  • Building AI applications
  • Developing AI agents
  • Managing models
  • Orchestrating workflows
  • Evaluating AI systems
  • Implementing responsible AI controls

Azure AI Foundry provides:

  • Model access
  • Prompt engineering tools
  • Agent frameworks
  • Retrieval and grounding tools
  • Evaluation systems
  • Safety controls
  • Deployment and monitoring capabilities

It integrates with many Azure AI services including:

  • Azure OpenAI
  • Azure AI Search
  • Azure AI Vision
  • Azure AI Language
  • Azure AI Document Intelligence
  • Azure AI Content Safety

Understanding the Core Service Categories

For the AI-103 exam, you should understand how Foundry services align to these major AI solution categories:

  1. Generative AI services
  2. Grounding and RAG services
  3. Vector search services
  4. Agent workflow services
  5. Multimodal processing services
  6. Evaluation and safety services

Generative AI Services

What Are Generative AI Services?

Generative AI services enable applications to:

  • Generate text
  • Summarize content
  • Create conversations
  • Produce code
  • Generate structured outputs
  • Perform reasoning tasks
  • Support AI copilots and assistants

The primary Foundry-related service for generative AI is:

  • Azure OpenAI Service

Azure OpenAI Service

Azure OpenAI provides access to advanced foundation models such as:

  • GPT models
  • GPT-4-class reasoning models
  • Multimodal GPT models
  • Embedding models
  • Audio-capable models

Azure OpenAI is commonly used for:

  • Chatbots
  • AI copilots
  • Content generation
  • AI agents
  • Coding assistants
  • Summarization
  • Question answering

When to Use Azure OpenAI

Use Azure OpenAI when the solution requires:

  • Natural language generation
  • Conversational AI
  • Complex reasoning
  • Function/tool calling
  • AI agents
  • Summarization
  • Code generation
  • Long-context processing

Example Generative AI Scenario

Scenario

A company wants to create an AI assistant that:

  • Answers employee questions
  • Summarizes internal documents
  • Generates emails
  • Uses enterprise data

Recommended Services:

  • Azure OpenAI
  • Azure AI Search
  • Embedding models
  • RAG architecture

Reason:

Azure OpenAI provides the conversational and reasoning capabilities.


Grounding and Retrieval-Augmented Generation (RAG)

What Is Grounding?

Grounding refers to providing AI models with reliable external data sources so responses are based on factual and current information.

Without grounding, LLMs may:

  • Hallucinate
  • Provide outdated information
  • Generate inaccurate answers

Grounding improves:

  • Accuracy
  • Relevance
  • Reliability
  • Enterprise trustworthiness

What Is Retrieval-Augmented Generation (RAG)?

RAG combines:

  • Retrieval systems
  • Embedding models
  • Vector search
  • Generative AI models

The workflow typically includes:

  1. Convert documents into embeddings
  2. Store vectors in a vector index
  3. Convert user query into embeddings
  4. Retrieve relevant content
  5. Inject retrieved content into the LLM prompt
  6. Generate grounded response

Azure Services Used for RAG

Common Azure services used for grounding and RAG include:

  • Azure AI Search
  • Azure OpenAI
  • Embedding models
  • Azure Storage
  • Azure Cosmos DB (optional)
  • Azure SQL Database with vector support

Azure AI Search

Azure AI Search is a core service for:

  • Vector search
  • Hybrid search
  • Semantic search
  • Enterprise retrieval
  • RAG pipelines

It enables applications to:

  • Index documents
  • Perform semantic retrieval
  • Store vector embeddings
  • Execute hybrid search queries

Types of Search in Azure AI Search

Keyword Search

Traditional lexical matching.

Example:

  • Exact term searches

Semantic Search

Understands contextual meaning.

Example:

  • Searching for “car” may also retrieve “vehicle.”

Vector Search

Uses embeddings to retrieve semantically similar content.

Example:

  • Finding conceptually similar documents even without exact keywords.

Hybrid Search

Combines:

  • Keyword search
  • Semantic ranking
  • Vector search

Hybrid search often produces the best retrieval quality.


When to Use Azure AI Search

Use Azure AI Search when applications require:

  • RAG
  • Semantic retrieval
  • Vector similarity search
  • Enterprise document retrieval
  • Knowledge-base search
  • Hybrid search scenarios

Example Grounding Scenario

Scenario

A healthcare chatbot must answer questions using the latest internal policy documents.

Recommended Services:

  • Azure OpenAI
  • Azure AI Search
  • Embedding models

Reason:

RAG enables grounded responses using current enterprise documents.


Vector Search Services

What Is Vector Search?

Vector search retrieves information based on semantic similarity rather than exact text matching.

Documents and queries are converted into numerical vectors called embeddings.

Similar meanings produce similar vectors.


Embedding Models

Embedding models transform content into vector representations.

These embeddings support:

  • Similarity matching
  • Semantic retrieval
  • Recommendation systems
  • RAG pipelines

Azure Services Supporting Vector Search

Azure AI Search

Primary enterprise vector search platform.


Azure Cosmos DB

Can support vector indexing and similarity search.

Useful for:

  • Globally distributed systems
  • High-scale AI applications

Azure SQL Database

Supports vector operations in modern AI workloads.

Useful for:

  • Structured enterprise systems
  • Integrated relational and AI workloads

Choosing the Correct Vector Search Service

Use Azure AI Search When:

  • Building enterprise RAG systems
  • Implementing hybrid search
  • Using semantic ranking
  • Creating AI copilots

Use Azure Cosmos DB When:

  • Global distribution is required
  • Massive scale is needed
  • NoSQL flexibility is important

Use Azure SQL Database When:

  • AI functionality must integrate with relational data
  • Existing SQL systems already exist

Agent Workflow Services

What Are AI Agents?

AI agents are AI systems capable of:

  • Reasoning
  • Planning
  • Tool usage
  • Multi-step execution
  • Task automation
  • Dynamic decision-making

Unlike basic chatbots, agents can:

  • Take actions
  • Call APIs
  • Use memory
  • Execute workflows
  • Interact with systems

Azure AI Foundry Agent Capabilities

Azure AI Foundry supports agent development with:

  • Tool calling
  • Function calling
  • Prompt orchestration
  • Workflow execution
  • Agent memory
  • Retrieval integration

Prompt Flow

Prompt Flow is a key Foundry tool for building:

  • AI workflows
  • Prompt chains
  • Tool orchestration
  • Agent pipelines
  • Multi-step AI systems

Prompt Flow helps developers:

  • Test prompts
  • Connect services
  • Evaluate outputs
  • Build reusable workflows

Tool Calling and Function Calling

LLMs can interact with external systems using:

  • Tool calling
  • Function calling

Examples:

  • Query databases
  • Call REST APIs
  • Retrieve documents
  • Send emails
  • Trigger workflows

This is a critical AI-103 topic.


Agent Workflow Scenario

Scenario

An AI travel assistant must:

  • Search flights
  • Check hotel pricing
  • Access calendars
  • Generate itineraries

Recommended Services:

  • Azure OpenAI
  • Prompt Flow
  • Agent orchestration tools
  • Tool/function calling

Reason:

This solution requires multi-step agent workflows.


Multimodal Processing Services

What Is Multimodal Processing?

Multimodal AI systems process multiple types of input such as:

  • Text
  • Images
  • Audio
  • Video
  • Documents

These systems combine multiple modalities to improve understanding.


Azure Services for Multimodal Processing

Common services include:

  • Azure OpenAI multimodal models
  • Azure AI Vision
  • Azure AI Document Intelligence
  • Azure AI Speech

Azure AI Vision

Azure AI Vision supports:

  • Image analysis
  • Object detection
  • OCR
  • Face analysis
  • Caption generation
  • Scene understanding

Use Azure AI Vision when applications require:

  • Image processing
  • Computer vision
  • OCR tasks
  • Visual analysis

Azure AI Document Intelligence

Azure AI Document Intelligence extracts structured information from documents such as:

  • Invoices
  • Receipts
  • Contracts
  • Forms
  • IDs

Capabilities include:

  • OCR
  • Key-value extraction
  • Layout analysis
  • Table extraction
  • Custom models

Azure AI Speech

Azure AI Speech supports:

  • Speech-to-text
  • Text-to-speech
  • Translation
  • Voice assistants
  • Real-time transcription

Choosing the Correct Multimodal Service

Use Azure AI Vision When:

  • Analyzing images
  • Detecting objects
  • Extracting text from images

Use Azure AI Document Intelligence When:

  • Extracting structured document data
  • Processing forms and invoices
  • Understanding layouts and tables

Use Azure AI Speech When:

  • Processing voice input
  • Building voice assistants
  • Performing speech transcription

Use Azure OpenAI Multimodal Models When:

  • Combining conversational reasoning with image understanding
  • Performing multimodal interactions
  • Building advanced AI assistants

Safety and Responsible AI Services

AI solutions require safety and governance.

Azure AI Foundry includes services such as:

  • Azure AI Content Safety
  • Content filtering
  • Prompt injection detection
  • Harm detection

These services help:

  • Detect unsafe content
  • Prevent abuse
  • Improve compliance
  • Support responsible AI development

Evaluation and Monitoring Services

Azure AI Foundry provides evaluation tools for:

  • Groundedness
  • Relevance
  • Accuracy
  • Latency
  • Cost
  • Toxicity
  • Hallucination detection

Evaluation is important because AI quality can vary significantly.


Choosing the Correct Foundry Service

The AI-103 exam frequently tests scenario-based service selection.


Scenario 1: Enterprise Knowledge Chatbot

Requirements:

  • Conversational AI
  • Enterprise document grounding
  • Semantic retrieval

Recommended Services:

  • Azure OpenAI
  • Azure AI Search
  • Embedding models

Scenario 2: Invoice Processing System

Requirements:

  • OCR
  • Table extraction
  • Structured document understanding

Recommended Services:

  • Azure AI Document Intelligence

Scenario 3: AI Agent with Workflow Automation

Requirements:

  • Tool usage
  • API calls
  • Multi-step execution

Recommended Services:

  • Azure OpenAI
  • Prompt Flow
  • Agent orchestration tools

Scenario 4: Image Analysis Application

Requirements:

  • Object detection
  • Image captioning
  • OCR

Recommended Services:

  • Azure AI Vision

Scenario 5: Semantic Product Search

Requirements:

  • Similarity search
  • Semantic retrieval
  • Vector indexing

Recommended Services:

  • Azure AI Search
  • Embedding models

Common AI-103 Exam Tips

Understand Service Roles

Know which services specialize in:

  • Generative AI
  • Retrieval
  • Search
  • Vision
  • Speech
  • Documents
  • Agent workflows

Know Common Service Pairings

Azure OpenAI + Azure AI Search

Used for:

  • RAG systems
  • Enterprise chatbots
  • Knowledge assistants

Azure OpenAI + Prompt Flow

Used for:

  • AI agents
  • Multi-step workflows
  • Tool orchestration

Azure AI Vision + Azure OpenAI

Used for:

  • Multimodal assistants
  • Visual question answering

Remember Hybrid Search

Hybrid search combines:

  • Vector search
  • Keyword search
  • Semantic ranking

This is commonly tested on AI-103.


Know When Specialized Services Are Better

Example:

  • Azure AI Document Intelligence is better for invoice extraction than using only a general-purpose LLM.

Summary

Selecting the appropriate Azure AI Foundry services is essential for building scalable, accurate, and cost-effective AI applications.

For the AI-103 exam, you should understand:

  • Which services support generative AI
  • How grounding and RAG work
  • When to use vector search
  • How AI agents are orchestrated
  • Which services support multimodal processing
  • How Azure AI Search integrates into enterprise AI systems
  • How Prompt Flow supports AI workflows
  • The role of specialized services like Vision and Document Intelligence

Strong service-selection skills are critical for both certification success and real-world Azure AI solution development.


Practice Exam Questions

Question 1

Which Azure service is MOST commonly used to provide generative AI chat capabilities?

A. Azure AI Search
B. Azure OpenAI
C. Azure AI Vision
D. Azure Monitor

Answer

B. Azure OpenAI

Explanation

Azure OpenAI provides access to GPT-based generative AI models.


Question 2

What is the primary purpose of Retrieval-Augmented Generation (RAG)?

A. Reduce GPU usage
B. Improve groundedness using retrieved data
C. Replace embeddings
D. Eliminate vector search

Answer

B. Improve groundedness using retrieved data

Explanation

RAG retrieves relevant information to ground LLM responses.


Question 3

Which Azure service is MOST appropriate for vector search and semantic retrieval?

A. Azure AI Search
B. Azure Backup
C. Azure DNS
D. Azure Automation

Answer

A. Azure AI Search

Explanation

Azure AI Search provides vector indexing and semantic retrieval capabilities.


Question 4

Which Foundry tool is designed for building multi-step AI workflows and prompt orchestration?

A. Azure Policy
B. Prompt Flow
C. Azure Backup
D. Azure DevOps

Answer

B. Prompt Flow

Explanation

Prompt Flow supports orchestration of prompts, tools, and workflows.


Question 5

A solution must extract tables and key-value pairs from invoices. Which service is MOST appropriate?

A. Azure AI Vision
B. Azure AI Document Intelligence
C. Azure Monitor
D. Azure AI Search

Answer

B. Azure AI Document Intelligence

Explanation

Document Intelligence specializes in structured document extraction.


Question 6

Which capability allows an LLM to interact with APIs and external systems?

A. OCR
B. Function calling
C. Vectorization
D. Semantic ranking

Answer

B. Function calling

Explanation

Function calling enables AI models to invoke external tools and APIs.


Question 7

Which Azure service is MOST appropriate for image analysis and object detection?

A. Azure AI Vision
B. Azure AI Search
C. Azure Cosmos DB
D. Azure SQL Database

Answer

A. Azure AI Vision

Explanation

Azure AI Vision provides computer vision capabilities.


Question 8

What is the main purpose of embeddings in AI applications?

A. Image generation
B. Semantic vector representation
C. Text-to-speech conversion
D. Function orchestration

Answer

B. Semantic vector representation

Explanation

Embeddings convert content into vectors for semantic similarity operations.


Question 9

Which search method combines vector search, keyword search, and semantic ranking?

A. Lexical search
B. OCR search
C. Hybrid search
D. Binary search

Answer

C. Hybrid search

Explanation

Hybrid search combines multiple retrieval methods for improved results.


Question 10

Which Azure AI service is MOST appropriate for speech-to-text transcription?

A. Azure AI Speech
B. Azure AI Search
C. Azure AI Vision
D. Azure Policy

Answer

A. Azure AI Speech

Explanation

Azure AI Speech provides speech recognition and transcription capabilities.


Go to the AI-103 Exam Prep Hub main page

Choose an appropriate model for each task, including large language models (LLMs), small language models, multimodal models, and Foundry Tools (AI-103 Exam Prep)

This post is a part of the AI-103: Develop AI Apps and Agents on Azure Exam Prep Hub. 
This topic falls under these sections:
Plan and manage an Azure AI solution (25–30%)
--> Choose the appropriate Foundry services for generative AI and agents
--> Choose an appropriate model for each task, including large language models (LLMs), small language models, multimodal models, and Foundry Tools


Note that there are 10 practice questions (with answers and explanations) at the end of each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available from the hub's main page below the exam topics section.

Introduction

One of the most important skills for the AI-103: Develop AI Apps and Agents on Azure certification exam is understanding how to choose the correct AI model and supporting Azure AI Foundry tools for a given business or technical scenario.

Modern AI development is no longer about simply selecting “an AI model.” Instead, developers must evaluate:

  • The type of task being performed
  • Cost constraints
  • Latency requirements
  • Accuracy expectations
  • Reasoning complexity
  • Context window needs
  • Multimodal capabilities
  • Deployment environment
  • Security and governance requirements
  • Agent orchestration requirements

Azure AI Foundry provides access to multiple categories of models and tools that help developers build generative AI applications and AI agents efficiently.

For the AI-103 exam, you should understand:

  • When to use Large Language Models (LLMs)
  • When Small Language Models (SLMs) are preferable
  • When multimodal models are required
  • How Azure AI Foundry tools support model selection and orchestration
  • Tradeoffs between performance, cost, speed, and capability
  • Common real-world scenarios for each model category

Azure AI Foundry Overview

Azure AI Foundry is Microsoft’s unified platform for building, evaluating, deploying, and managing AI applications and agents.

Azure AI Foundry provides:

  • Access to foundation models
  • Agent development capabilities
  • Prompt engineering tools
  • Evaluation tools
  • Safety and content filtering
  • Retrieval-augmented generation (RAG) support
  • Fine-tuning capabilities
  • Monitoring and observability
  • Integration with Azure AI services

Azure AI Foundry enables developers to:

  • Compare multiple models
  • Test prompts
  • Evaluate outputs
  • Build AI agents
  • Connect enterprise data
  • Deploy scalable AI applications

For the AI-103 exam, understanding the relationship between model capabilities and Azure AI Foundry tools is extremely important.


Understanding Model Categories

The exam focuses heavily on selecting the correct model type for specific tasks.

The major categories include:

  1. Large Language Models (LLMs)
  2. Small Language Models (SLMs)
  3. Multimodal Models
  4. Embedding Models
  5. Specialized Models

Each category serves different purposes.


Large Language Models (LLMs)

What Are Large Language Models?

Large Language Models are advanced AI models trained on massive datasets containing text, code, and other information.

LLMs are designed for:

  • Natural language understanding
  • Natural language generation
  • Complex reasoning
  • Summarization
  • Coding assistance
  • Question answering
  • Conversational AI
  • Agent workflows
  • Content creation

Examples include:

  • GPT-4 family models
  • GPT-4o models
  • GPT-4 Turbo
  • Phi large models
  • Other frontier foundation models available in Azure AI Foundry

Characteristics of LLMs

Strengths

LLMs are excellent at:

Complex Reasoning

Examples:

  • Multi-step problem solving
  • Data interpretation
  • Logical analysis
  • Decision support

Advanced Content Generation

Examples:

  • Marketing content
  • Technical documentation
  • Email drafting
  • Knowledge-base generation

Conversational Experiences

Examples:

  • AI chatbots
  • AI copilots
  • Virtual assistants
  • Interactive tutoring systems

Agentic Workflows

LLMs are commonly used as the “reasoning engine” behind AI agents.

They can:

  • Plan tasks
  • Determine next actions
  • Call tools
  • Use memory
  • Chain workflows
  • Interact with APIs

Limitations of LLMs

Although powerful, LLMs have tradeoffs.

Higher Cost

LLMs generally:

  • Require more compute
  • Cost more per token
  • Increase infrastructure expenses

Increased Latency

Larger models may:

  • Respond more slowly
  • Increase application response times
  • Affect real-time user experiences

Resource Requirements

LLMs require:

  • More GPU resources
  • More memory
  • Larger deployments

Overkill for Simple Tasks

Using GPT-4-level reasoning for basic classification or short summarization tasks may be unnecessary and expensive.


When to Use LLMs

Choose an LLM when tasks require:

  • Advanced reasoning
  • Long-context understanding
  • High-quality content generation
  • Complex conversational behavior
  • Tool calling and agent orchestration
  • Coding assistance
  • Sophisticated summarization
  • Enterprise copilots

Example LLM Scenarios

Scenario 1: Enterprise AI Copilot

A company wants an AI assistant that:

  • Reads internal documentation
  • Answers employee questions
  • Generates summaries
  • Explains policies
  • Uses tools and APIs

Best choice:

  • Large Language Model with RAG integration

Reason:

  • Requires reasoning and conversational understanding.

Scenario 2: AI Coding Assistant

A development team needs:

  • Code generation
  • Debugging suggestions
  • Refactoring support
  • Documentation generation

Best choice:

  • Advanced LLM

Reason:

  • Coding tasks require complex contextual reasoning.

Small Language Models (SLMs)

What Are Small Language Models?

Small Language Models are more lightweight AI models optimized for:

  • Faster responses
  • Lower costs
  • Lower resource consumption
  • Edge deployments
  • Narrower tasks

Examples include:

  • Smaller Phi models
  • Compact transformer-based models
  • Task-specific lightweight models

Characteristics of SLMs

Strengths

Lower Cost

SLMs:

  • Consume fewer resources
  • Cost less to run
  • Reduce token usage costs

Faster Inference

SLMs typically:

  • Respond more quickly
  • Improve responsiveness
  • Support near real-time interactions

Edge and Mobile Suitability

SLMs may run:

  • On edge devices
  • On mobile hardware
  • In constrained environments

Efficient for Narrow Tasks

SLMs work well for:

  • Classification
  • Basic summarization
  • Intent detection
  • Simple chat interactions
  • Lightweight automation

Limitations of SLMs

Reduced Reasoning Ability

Compared to LLMs, SLMs may struggle with:

  • Complex logic
  • Long context handling
  • Multi-step reasoning
  • Sophisticated conversations

Lower Output Quality

Outputs may:

  • Be less nuanced
  • Contain reduced detail
  • Provide weaker contextual understanding

When to Use SLMs

Choose an SLM when:

  • Speed is critical
  • Cost optimization matters
  • Tasks are relatively simple
  • Edge deployment is needed
  • High throughput is required
  • Lightweight AI experiences are sufficient

Example SLM Scenarios

Scenario 1: Customer Intent Classification

An application classifies support tickets into categories such as:

  • Billing
  • Technical support
  • Returns
  • Sales

Best choice:

  • Small Language Model

Reason:

  • Classification is relatively simple and does not require advanced reasoning.

Scenario 2: Edge Device Assistant

A manufacturing company deploys an AI assistant on factory equipment with limited compute.

Best choice:

  • Small Language Model

Reason:

  • Edge environments benefit from lightweight models.

Multimodal Models

What Are Multimodal Models?

Multimodal models can process multiple data types simultaneously.

Examples include:

  • Text
  • Images
  • Audio
  • Video
  • Documents

These models combine information across modalities to produce richer outputs.


Capabilities of Multimodal Models

Multimodal models can:

  • Analyze images and answer questions about them
  • Generate captions from images
  • Extract information from documents
  • Process speech and text together
  • Understand charts and diagrams
  • Support visual reasoning

Common Multimodal Tasks

Image Understanding

Examples:

  • Object detection
  • Scene analysis
  • Image captioning
  • Visual question answering

Document Intelligence

Examples:

  • Invoice extraction
  • Receipt processing
  • Form analysis
  • OCR workflows

Audio + Text Experiences

Examples:

  • Voice assistants
  • Meeting summarization
  • Speech transcription
  • Audio analysis

When to Use Multimodal Models

Choose multimodal models when applications involve:

  • Images and text together
  • Document processing
  • Speech interactions
  • Visual understanding
  • Cross-modal reasoning

Example Multimodal Scenarios

Scenario 1: Invoice Processing

A company needs to:

  • Read invoices
  • Extract totals
  • Identify vendors
  • Validate line items

Best choice:

  • Multimodal document processing model

Reason:

  • The solution must interpret both layout and text.

Scenario 2: Retail Image Assistant

Users upload photos of products and ask questions about them.

Best choice:

  • Multimodal model

Reason:

  • Requires simultaneous image and text understanding.

Embedding Models

What Are Embedding Models?

Embedding models convert text or other content into vector representations.

These vectors capture semantic meaning.

Embedding models are essential for:

  • Semantic search
  • Retrieval-Augmented Generation (RAG)
  • Similarity matching
  • Recommendation systems
  • Knowledge retrieval

Retrieval-Augmented Generation (RAG)

RAG combines:

  • Embedding models
  • Vector databases
  • LLMs

Workflow:

  1. Convert documents into embeddings
  2. Store embeddings in a vector index
  3. Convert user query into embeddings
  4. Retrieve relevant content
  5. Send retrieved data to the LLM

RAG improves:

  • Accuracy
  • Freshness of information
  • Enterprise grounding
  • Hallucination reduction

Specialized Models

Some tasks are better handled by specialized AI models instead of general-purpose LLMs.

Examples:

  • Translation models
  • Speech models
  • OCR models
  • Vision models
  • Classification models

Why Specialized Models Matter

Specialized models may provide:

  • Better accuracy
  • Lower cost
  • Faster performance
  • Simpler deployment

Example:

Using a dedicated OCR service is often more efficient than asking an LLM to read text from images.


Model Selection Factors

The AI-103 exam heavily tests your ability to select the correct model based on requirements.


Factor 1: Task Complexity

Use LLMs For:

  • Advanced reasoning
  • Multi-step workflows
  • Complex conversations

Use SLMs For:

  • Simple classification
  • Lightweight interactions
  • Fast automation

Factor 2: Cost

LLMs

  • Higher operational cost
  • More expensive inference

SLMs

  • Lower operational cost
  • Better for high-volume workloads

Factor 3: Latency

Low-Latency Requirements

Prefer:

  • SLMs
  • Lightweight models

Complex Processing

Prefer:

  • LLMs

Even if response time increases.


Factor 4: Context Window

Some tasks require processing:

  • Long documents
  • Large conversations
  • Extensive histories

Choose models with larger context windows for:

  • Legal analysis
  • Knowledge assistants
  • Long-form summarization

Factor 5: Multimodal Requirements

If the application involves:

  • Images
  • Audio
  • Video
  • Documents

Choose multimodal-capable models.


Factor 6: Deployment Environment

Cloud-Hosted Applications

May use:

  • Large frontier models
  • GPU-intensive deployments

Edge or Mobile Deployments

Prefer:

  • Small models
  • Quantized models
  • Lightweight inference

Azure AI Foundry Tools

Azure AI Foundry includes numerous tools that support model selection and AI application development.


Model Catalog

The Model Catalog allows developers to:

  • Browse available models
  • Compare capabilities
  • Review benchmarks
  • Deploy models
  • Evaluate pricing

The catalog includes:

  • Microsoft-hosted models
  • Open-source models
  • Partner models
  • Frontier models

Prompt Flow

Prompt Flow helps developers:

  • Build AI workflows
  • Chain prompts together
  • Integrate tools
  • Evaluate prompts
  • Test model behavior

Prompt Flow is useful for:

  • Agent orchestration
  • RAG pipelines
  • Multi-step AI workflows

AI Agent Development Tools

Azure AI Foundry supports AI agents that can:

  • Use tools
  • Access data
  • Maintain memory
  • Perform actions
  • Execute workflows

Agent frameworks may include:

  • Tool calling
  • Function calling
  • Retrieval integration
  • Multi-agent orchestration

Evaluation Tools

Evaluation tools help developers assess:

  • Accuracy
  • Groundedness
  • Safety
  • Relevance
  • Latency
  • Cost

Evaluation is critical because model quality varies by task.


Content Safety Tools

Azure AI Foundry includes safety features such as:

  • Content filtering
  • Harm detection
  • Prompt injection detection
  • Responsible AI controls

These tools help ensure safe AI deployments.


Fine-Tuning Tools

Fine-tuning allows developers to customize models using:

  • Domain-specific data
  • Proprietary terminology
  • Specialized workflows

Fine-tuning may improve:

  • Accuracy
  • Consistency
  • Industry-specific responses

However, fine-tuning also:

  • Increases cost
  • Requires data preparation
  • Adds operational complexity

Choosing Between Prompt Engineering, RAG, and Fine-Tuning

This is a very important AI-103 exam topic.


Prompt Engineering

Use when:

  • You need quick customization
  • Tasks are general-purpose
  • No private data integration is needed

Advantages:

  • Fast
  • Cheap
  • Easy to maintain

RAG

Use when:

  • You need current or proprietary data
  • You want grounding in enterprise content
  • You need dynamic knowledge retrieval

Advantages:

  • Reduces hallucinations
  • Keeps knowledge current
  • Avoids retraining

Fine-Tuning

Use when:

  • Consistent specialized outputs are required
  • Domain language is highly unique
  • Behavioral customization is necessary

Advantages:

  • Tailored responses
  • Better domain alignment

Real-World Model Selection Examples

Example 1: FAQ Chatbot

Requirements:

  • Low cost
  • Fast responses
  • Basic conversational support

Best Choice:

  • Small Language Model + RAG

Example 2: Legal Document Assistant

Requirements:

  • Long-context understanding
  • Detailed summarization
  • Advanced reasoning

Best Choice:

  • Large Language Model with large context window

Example 3: Mobile AI App

Requirements:

  • Offline capability
  • Fast performance
  • Low resource usage

Best Choice:

  • Small Language Model

Example 4: Image-Based Customer Support

Requirements:

  • Analyze uploaded photos
  • Understand text and images
  • Generate responses

Best Choice:

  • Multimodal model

Key AI-103 Exam Tips

Understand Tradeoffs

You should know:

  • Bigger models are not always better
  • Simpler tasks may not require advanced LLMs
  • Cost and latency matter
  • Specialized models may outperform general models

Know Common Pairings

LLM + RAG

Used for:

  • Enterprise chatbots
  • Knowledge assistants
  • AI copilots

Embeddings + Vector Search

Used for:

  • Semantic search
  • Knowledge retrieval
  • Similarity matching

Multimodal Models

Used for:

  • Vision AI
  • Document processing
  • Audio interactions

Learn the Azure AI Foundry Ecosystem

Know the purpose of:

  • Model Catalog
  • Prompt Flow
  • Evaluation tools
  • Agent tools
  • Safety systems
  • Fine-tuning workflows

Summary

Selecting the correct AI model is one of the most important responsibilities for an Azure AI developer.

For the AI-103 exam, you should understand:

  • The differences between LLMs and SLMs
  • When multimodal models are required
  • How embedding models support RAG
  • When specialized models outperform general-purpose models
  • The tradeoffs between cost, speed, and reasoning capability
  • How Azure AI Foundry tools support AI development and orchestration

In real-world AI systems, choosing the correct model can dramatically improve:

  • Performance
  • User experience
  • Scalability
  • Operational cost
  • Reliability
  • Maintainability

A strong understanding of model selection is essential for designing effective Azure AI applications and AI agents.


Practice Exam Questions

Question 1

A company is building an enterprise AI assistant that must answer complex employee questions using internal documentation and perform multi-step reasoning. Which model type is MOST appropriate?

A. Small Language Model (SLM)
B. Embedding model only
C. Large Language Model (LLM)
D. OCR model

Answer

C. Large Language Model (LLM)

Explanation

Complex reasoning and conversational understanding are best handled by LLMs.


Question 2

Which model type is generally BEST for low-cost, low-latency classification tasks?

A. Large multimodal model
B. Small Language Model (SLM)
C. GPT-4-class reasoning model
D. Vision foundation model

Answer

B. Small Language Model (SLM)

Explanation

SLMs are optimized for lightweight and cost-efficient tasks.


Question 3

A solution must process uploaded invoices and extract totals, vendor names, and line items. Which model type is MOST appropriate?

A. Embedding model
B. Small Language Model
C. Multimodal model
D. Translation model

Answer

C. Multimodal model

Explanation

Invoice extraction requires understanding both layout and text.


Question 4

What is the primary purpose of embedding models?

A. Image generation
B. Semantic vector representation
C. Audio transcription
D. Tool orchestration

Answer

B. Semantic vector representation

Explanation

Embedding models convert content into vectors for semantic search and retrieval.


Question 5

Which Azure AI Foundry tool helps developers chain prompts, integrate tools, and build AI workflows?

A. Azure Monitor
B. Prompt Flow
C. Azure Policy
D. Azure Functions

Answer

B. Prompt Flow

Explanation

Prompt Flow is designed for workflow orchestration and prompt pipelines.


Question 6

A mobile AI application must operate with minimal compute resources and very fast response times. Which model type is MOST appropriate?

A. Large Language Model
B. Small Language Model
C. Large multimodal model
D. High-context reasoning model

Answer

B. Small Language Model

Explanation

SLMs are optimized for lightweight and edge deployments.


Question 7

Which approach is BEST when an AI chatbot must use current enterprise data without retraining the model?

A. Fine-tuning only
B. Prompt engineering only
C. Retrieval-Augmented Generation (RAG)
D. Quantization

Answer

C. Retrieval-Augmented Generation (RAG)

Explanation

RAG retrieves current information dynamically without retraining.


Question 8

Which factor MOST strongly indicates that a multimodal model is required?

A. Need for vector embeddings
B. Need for faster response times
C. Need to process images and text together
D. Need for lower cost

Answer

C. Need to process images and text together

Explanation

Multimodal models handle multiple input modalities simultaneously.


Question 9

What is a major tradeoff of using larger language models?

A. Reduced reasoning capability
B. Lower context windows
C. Increased operational cost
D. Inability to support agents

Answer

C. Increased operational cost

Explanation

Larger models typically require more compute resources and cost more.


Question 10

Which Azure AI Foundry capability helps evaluate model quality, safety, and groundedness?

A. Azure Load Balancer
B. Evaluation tools
C. Azure Backup
D. Traffic Manager

Answer

B. Evaluation tools

Explanation

Evaluation tools assess output quality, safety, and performance metrics.


Go to the AI-103 Exam Prep Hub main page

Implement a solution that generates videos from text prompts and reference media (AI-103 Exam Prep)

This post is a part of the AI-103: Develop AI Apps and Agents on Azure Exam Prep Hub. 
This topic falls under these sections:
Implement computer vision solutions (10–15%)
--> Design and implement image- and video-generation solutions
--> Implement a solution that generates videos from text prompts and reference media


Note that there are 10 practice questions (with answers and explanations) at the end of each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available from the hub's main page below the exam topics section.

Introduction

Generative AI is rapidly expanding beyond text and images into video generation. Modern AI systems can now create short videos, animations, cinematic scenes, marketing clips, and visual simulations using:

  • Natural language prompts
  • Existing videos
  • Reference images
  • Style examples
  • Storyboards
  • Multi-modal inputs

For the AI-103 certification exam, you should understand how to design and implement solutions that generate videos from:

  • Text prompts
  • Reference media
  • Multi-modal instructions

You should also understand:

  • Video generation workflows
  • Multi-modal AI concepts
  • Prompt engineering for video
  • Video editing and transformation
  • Responsible AI considerations
  • Performance and scalability
  • Azure AI services used in video generation pipelines

This topic falls under:

“Design and implement image- and video-generation solutions”


What Is AI Video Generation?

AI video generation uses generative AI models to create or modify videos based on user instructions.

Inputs may include:

  • Text prompts
  • Images
  • Existing videos
  • Style references
  • Scene descriptions
  • Character references
  • Motion instructions

Outputs may include:

  • Animated clips
  • Cinematic scenes
  • Marketing videos
  • Product demonstrations
  • Simulated environments
  • AI-enhanced video edits

Text-to-Video Generation

What Is Text-to-Video Generation?

Text-to-video generation converts natural language descriptions into video sequences.

Example prompt:

A drone flying through a futuristic city at night with neon lights reflecting on wet streets

The model interprets:

  • Objects
  • Movement
  • Lighting
  • Scene transitions
  • Camera motion
  • Temporal consistency

and generates a video sequence.


How Video Generation Differs from Image Generation

Video generation is more complex because models must maintain:

  • Motion consistency
  • Temporal continuity
  • Object persistence
  • Lighting stability
  • Camera coherence

Instead of generating a single frame, the model generates a sequence of connected frames.


Temporal Consistency

What Is Temporal Consistency?

Temporal consistency ensures that:

  • Objects remain stable across frames
  • Characters retain appearance
  • Motion looks natural
  • Lighting stays coherent

Without temporal consistency:

  • Objects may flicker
  • Faces may distort
  • Backgrounds may shift unpredictably

Common Video Generation Use Cases

Marketing and Advertising

Generate:

  • Promotional videos
  • Social media content
  • Product showcases

Entertainment and Media

Create:

  • Animations
  • Storyboards
  • Visual effects
  • Cinematic previews

Education and Training

Generate:

  • Simulations
  • Tutorials
  • Visual explanations

Gaming

Create:

  • Cutscenes
  • Environmental animations
  • NPC interactions

Enterprise Applications

Generate:

  • Training videos
  • Virtual demonstrations
  • AI-powered presentations

Video Generation Models

Modern AI video systems commonly use:

  • Diffusion models
  • Transformer architectures
  • Multi-modal generative models

These models learn relationships between:

  • Text
  • Images
  • Motion
  • Time sequences

Diffusion Models for Video

Video diffusion models operate similarly to image diffusion models but add temporal processing.

The model:

  1. Starts with noisy frames
  2. Gradually refines them
  3. Maintains frame-to-frame consistency

Multi-Modal Video Generation

Video generation often combines:

  • Text prompts
  • Images
  • Motion guidance
  • Audio
  • Style references

This is called multi-modal generation.


Example Multi-Modal Workflow

Inputs:

  • Character image
  • Text prompt
  • Style reference

Output:

  • Animated video clip matching the character and style

Prompt Engineering for Video Generation

Why Prompt Engineering Matters

Prompt design strongly affects:

  • Scene quality
  • Motion realism
  • Camera movement
  • Style consistency
  • Subject accuracy

Effective Video Prompt Components

Strong prompts often include:

ComponentExample
Subject“A red sports car”
Action“driving through mountain roads”
Environment“during sunrise”
Camera movement“cinematic tracking shot”
Style“photorealistic”
Mood“dramatic atmosphere”

Example Prompt

A photorealistic cinematic tracking shot of a red sports car driving through mountain roads during sunrise, dramatic atmosphere, ultra detailed

Camera and Motion Instructions

Prompts can specify:

  • Zoom
  • Pan
  • Tilt
  • Tracking shots
  • Slow motion
  • Time-lapse

Example:

Slow-motion close-up shot of ocean waves crashing against rocks

Reference Media in Video Generation

Reference media guides the model using:

  • Existing videos
  • Images
  • Character designs
  • Motion examples
  • Style references

Benefits of Reference Media

Reference media helps maintain:

  • Character consistency
  • Brand identity
  • Visual continuity
  • Artistic style
  • Scene structure

Image-to-Video Generation

What Is Image-to-Video Generation?

Image-to-video generation animates a static image.

The system adds:

  • Motion
  • Camera movement
  • Environmental effects
  • Character animation

Example

Input:

  • Portrait image

Prompt:

The person smiles gently while wind moves through their hair

Output:

  • Animated portrait video

Video-to-Video Transformation

What Is Video-to-Video Transformation?

Video-to-video systems modify existing videos while preserving motion structure.

Examples:

  • Style conversion
  • Cartoon transformation
  • Lighting changes
  • Scene modifications

Storyboard-Based Generation

Some systems generate videos from storyboard sequences.

Inputs may include:

  • Scene descriptions
  • Frame sketches
  • Timing instructions

The orchestration system generates connected scenes.


Video Editing with AI

Generative AI can also:

  • Remove objects
  • Replace backgrounds
  • Extend scenes
  • Improve quality
  • Add effects
  • Upscale video resolution

Inpainting for Video

Video inpainting edits selected regions across multiple frames.

Use cases:

  • Removing unwanted objects
  • Editing environments
  • Replacing logos
  • Correcting defects

Outpainting for Video

Video outpainting expands scenes beyond original frame boundaries.

Examples:

  • Widening landscapes
  • Expanding cinematic shots
  • Creating panoramic sequences

Responsible AI Considerations

Video generation introduces major Responsible AI concerns.


Deepfake Risks

AI-generated videos can impersonate real people.

Potential misuse includes:

  • Misinformation
  • Fraud
  • Identity impersonation

Harmful Content

Generated videos may contain:

  • Violence
  • Hate content
  • Explicit material
  • Unsafe scenarios

Copyright and Ownership

Generated videos may resemble:

  • Copyrighted characters
  • Artistic styles
  • Existing content

Organizations must ensure legal compliance.


Bias and Fairness

Generative systems may unintentionally reinforce:

  • Stereotypes
  • Representation bias
  • Cultural inaccuracies

Azure AI Content Safety

Microsoft provides:
Azure AI Content Safety

to help evaluate:

  • Unsafe prompts
  • Harmful generated outputs
  • Policy violations

Watermarking and Provenance

AI-generated videos may include:

  • Watermarks
  • Metadata
  • Provenance tracking

These help identify synthetic media.


Video Generation Workflow

A typical workflow may include:

  1. User submits prompt
  2. Input validation occurs
  3. Reference media processed
  4. Prompt enhancement
  5. Video model generates frames
  6. Temporal consistency checks occur
  7. Safety filtering runs
  8. Final rendering occurs
  9. Video stored or streamed

Performance Considerations

Video generation is computationally expensive.

Factors affecting performance include:

  • Video length
  • Resolution
  • Frame rate
  • Model complexity
  • Hardware acceleration

GPU Acceleration

Video generation heavily relies on GPUs for:

  • Parallel frame generation
  • Matrix operations
  • Rendering acceleration

Latency Challenges

Video generation typically requires more time than image generation because:

  • Many frames must be generated
  • Temporal relationships must be preserved
  • Rendering workloads are larger

Optimization Techniques

Generate Lower Resolution Drafts

Preview before full rendering.


Frame Interpolation

Generate fewer frames and interpolate intermediate motion.


Batch Rendering

Process multiple frames simultaneously.


Progressive Rendering

Return low-quality previews while high-quality rendering continues.


Azure Services for Video Generation Solutions

Azure OpenAI Service

Azure OpenAI Service

Supports:

  • Multi-modal AI workflows
  • Prompt-based generation
  • Integration with generative AI applications

Azure AI Foundry

Azure AI Foundry

Supports:

  • AI workflow orchestration
  • Prompt flows
  • Model evaluation
  • Multi-modal pipelines

Azure AI Vision

Azure AI Vision

Can support:

  • Scene analysis
  • Object recognition
  • Video understanding workflows

Azure Blob Storage

Azure Blob Storage

Frequently used for:

  • Storing generated videos
  • Media asset management
  • Content delivery integration

Azure Functions

Azure Functions

Often used for:

  • Video processing workflows
  • Trigger-based orchestration
  • Rendering automation

Integrating Video Generation into Applications

Applications may integrate AI video generation into:

  • Marketing platforms
  • Creative tools
  • Mobile apps
  • Enterprise copilots
  • Learning systems
  • Media production workflows

Example Enterprise Architecture

An enterprise training platform might:

  1. Accept a text lesson
  2. Generate storyboard prompts
  3. Create AI-generated training videos
  4. Apply narration and subtitles
  5. Run safety validation
  6. Store final videos in Blob Storage

Observability for Video Generation

Production systems should monitor:

  • Rendering latency
  • GPU utilization
  • Failed generations
  • Storage usage
  • Safety violations
  • Cost metrics

Human-in-the-Loop Review

Organizations often require manual review for:

  • Public-facing media
  • Brand-sensitive content
  • Regulated industries
  • High-risk synthetic media

Best Practices for Video Generation Solutions

Use Detailed Prompts

Detailed instructions improve video quality.


Use Reference Media Carefully

Ensure proper licensing and compliance.


Implement Content Safety

Validate prompts and generated outputs.


Monitor Computational Costs

Video generation can be expensive.


Optimize for Performance

Balance quality with rendering time.


Track Provenance

Identify synthetic content appropriately.


Use Human Review for Sensitive Content

Particularly important for public or regulated use cases.


Real-World Example

A travel company may implement a video generation solution that:

  1. Accepts destination photos
  2. Accepts prompt:
Create a cinematic tropical vacation advertisement with drone footage, sunset lighting, and relaxing atmosphere
  1. Generates short promotional videos
  2. Applies safety and brand validation
  3. Stores approved videos in Azure Blob Storage

This demonstrates:

  • Text-to-video generation
  • Reference media usage
  • Workflow orchestration
  • Responsible AI controls

Exam Tips for AI-103

For the AI-103 exam, remember these important concepts:

  • Text-to-video generation creates videos from natural language prompts.
  • Video generation requires temporal consistency across frames.
  • Reference media helps preserve style and continuity.
  • Multi-modal generation combines text, images, and motion guidance.
  • Prompt engineering strongly affects video quality.
  • Image-to-video generation animates static images.
  • Video-to-video transformation modifies existing videos.
  • Responsible AI concerns include deepfakes and harmful content.
  • Azure AI Content Safety helps moderate unsafe content.
  • GPU acceleration is critical for video generation performance.
  • Azure Blob Storage is commonly used for storing generated media.

Practice Exam Questions

Question 1

What is the primary purpose of text-to-video generation?

A. Compressing video files
B. Creating videos from natural language prompts
C. Encrypting media assets
D. Detecting malware in video streams

Answer

B. Creating videos from natural language prompts

Explanation

Text-to-video systems generate video sequences from prompt-based instructions.


Question 2

Why is temporal consistency important in AI video generation?

A. It reduces storage costs
B. It encrypts generated videos
C. It removes all latency
D. It ensures stable and coherent motion across frames

Answer

D. It ensures stable and coherent motion across frames

Explanation

Temporal consistency prevents flickering and maintains object continuity.


Question 3

What is image-to-video generation?

A. Converting videos into audio
B. Compressing images into ZIP files
C. Animating a static image into a video sequence
D. Translating subtitles automatically

Answer

C. Animating a static image into a video sequence

Explanation

Image-to-video generation adds movement and animation to still images.


Question 4

What is a common use of reference media in video generation?

A. Reducing network bandwidth
B. Maintaining visual consistency and style
C. Encrypting prompts
D. Eliminating GPU requirements

Answer

B. Maintaining visual consistency and style

Explanation

Reference media helps preserve branding, character appearance, and artistic direction.


Question 5

Which type of model is commonly used in AI video generation?

A. Diffusion models
B. Spreadsheet models
C. DNS models
D. Relational models

Answer

A. Diffusion models

Explanation

Diffusion-based architectures are widely used for generative media tasks.


Question 6

What is video inpainting?

A. Increasing frame rates automatically
B. Editing selected regions across video frames
C. Compressing video metadata
D. Removing subtitles

Answer

B. Editing selected regions across video frames

Explanation

Video inpainting modifies targeted portions of videos across multiple frames.


Question 7

Which Azure service helps detect harmful generated content?

A. Azure CDN
B. Azure Virtual WAN
C. Azure DNS
D. Azure AI Content Safety

Answer

D. Azure AI Content Safety

Explanation

Azure AI Content Safety evaluates prompts and outputs for unsafe or policy-violating content.


Question 8

Why are GPUs commonly used in video generation?

A. GPUs eliminate the need for prompts
B. GPUs improve parallel processing for rendering and generation
C. GPUs automatically moderate unsafe content
D. GPUs reduce internet latency

Answer

B. GPUs improve parallel processing for rendering and generation

Explanation

Video generation requires intensive computation that GPUs handle efficiently.


Question 9

Which Azure storage service is commonly used for storing generated videos?

A. Azure Blob Storage
B. Azure Queue Storage
C. Azure DNS
D. Azure Firewall

Answer

A. Azure Blob Storage

Explanation

Azure Blob Storage is commonly used for storing large media files.


Question 10

What is a major Responsible AI concern associated with AI-generated videos?

A. Deepfake misuse
B. Reduced CPU temperatures
C. Faster SQL queries
D. Lower image resolution

Answer

A. Deepfake misuse

Explanation

AI-generated videos can potentially be used for impersonation or misinformation.


Go to the AI-103 Exam Prep Hub main page

Exam Prep Hub for AI-901: Azure AI Fundamentals

Welcome to the AI-901: Azure AI Fundamentals Exam Prep Hub!

Welcome to the one-stop hub with information for preparing for the AI-901: Azure AI Fundamentals certification exam. The content for this exam helps you to demonstrate that “you have conceptual knowledge of AI solutions in Azure and the foundational technical skills to work with them”. You will also need “knowledge of Python coding syntax and programming techniques, and you should be familiar with Azure resources”.
Upon successful completion of the exam, you earn the Microsoft Certified: Azure AI Fundamentals certification.

This hub provides information directly here (topic-by-topic as outlined in the official study guide), links to a number of external resources, tips for preparing for the exam, practice tests, and section questions to help you prepare. Bookmark this page and use it as a guide to ensure that you are fully covering all relevant topics for the AI-901 exam and making use of as many of the resources available as possible.


Audience profile (from Microsoft’s site)



As a candidate for this Microsoft Certification, you’re at the beginning of your career in AI solution development. These Microsoft certifications offer opportunities to demonstrate your understanding of machine learning, AI concepts, and Azure services, whether you are starting your career or advancing your skills in AI solution development. Both certifications are designed for candidates from technical and non-technical backgrounds—prior experience in data science or software engineering is not required, though familiarity with basic cloud concepts and client-server applications will be helpful.
For the AI-901, you should have foundational knowledge of AI workloads and understand the basic principles of AI and machine learning. And also, you should have foundational technical skills for working with AI solutions in Azure, conceptual knowledge of Azure-based AI solutions, and familiarity with Python coding syntax and programming techniques, as well as Azure resources.
You may be eligible for ACE college credit if you pass this certification. See ACE college credit for certification exams for details.


Skills at a glance (as specified in the official study guide)

  • Identify AI concepts and responsibilities (40–45%)
  • Implement AI solutions by using Microsoft Foundry (55–60%)

Topic-by-Topic Exam Content

[click a topic link to access the content and practice questions for that topic]

Identify AI concepts and capabilities (40–45%)

Describe principles of responsible AI

Identify AI model components and configurations

Identify AI workloads

Implement AI solutions by using Microsoft Foundry (55–60%)

Implement generative AI apps and agents by using Foundry

Implement AI solutions for text and speech by using Foundry

Implement AI solutions with computer vision and image-generation capabilities by using Foundry

Implement AI solutions for information extraction by using Foundry


AI-901 Practice Exams


Important AI-901 Resources


Good luck to you on your data journey!

AI-901: Microsoft Azure AI Fundamentals – Practice Exam #4 (30 Questions)


Question 1

Which AI workload is BEST suited for forecasting future inventory demand?

A. Regression
B. OCR
C. Object detection
D. Speech synthesis


Correct Answer

A. Regression


Explanation

Regression predicts continuous numeric values, such as future inventory levels or sales demand.


Question 2

A company wants to automatically categorize support tickets into departments such as Billing, Technical Support, and Sales.

Which machine learning technique should be used?

A. Classification
B. Clustering
C. Regression
D. Forecasting


Correct Answer

A. Classification


Question 3

Which Responsible AI principle emphasizes designing systems that can be used by people with disabilities?

A. Inclusiveness
B. Reliability
C. Accountability
D. Transparency


Correct Answer

A. Inclusiveness


Question 4

What is tokenization in generative AI?

A. Breaking text into smaller pieces for processing
B. Encrypting storage devices
C. Compressing images
D. Improving network bandwidth


Correct Answer

A. Breaking text into smaller pieces for processing


Question 5

HOTSPOT / MATCHING

Match each AI capability with the appropriate use case.

Use CaseCapability
Creating realistic AI-generated artwork?
Identifying the language of a sentence?
Extracting text from a passport image?

Options:

  • OCR
  • Language detection
  • Image generation

Correct Answers

Use CaseCapability
Creating realistic AI-generated artworkImage generation
Identifying the language of a sentenceLanguage detection
Extracting text from a passport imageOCR

Question 6

Which Azure AI capability allows applications to answer questions conversationally?

A. Generative AI
B. Regression
C. Clustering
D. Forecasting


Correct Answer

A. Generative AI


Question 7

A retailer wants to detect when shelves in stores are empty by analyzing camera images.

Which AI capability should be used?

A. Object detection
B. Speech recognition
C. Translation
D. Regression


Correct Answer

A. Object detection


Question 8

MULTIPLE ANSWER

Which are common examples of generative AI applications?

Select ALL that apply.

A. Chatbots
B. AI image creation
C. Automated code generation
D. Spreadsheet printing
E. Content summarization


Correct Answers

A. Chatbots
B. AI image creation
C. Automated code generation
E. Content summarization


Question 9

What is the purpose of an API key in Azure AI services?

A. To authenticate access to the service
B. To increase monitor brightness
C. To compress image files
D. To replace endpoints


Correct Answer

A. To authenticate access to the service


Question 10

Which AI capability identifies names, locations, or dates in text?

A. Entity extraction
B. Object detection
C. OCR
D. Speech synthesis


Correct Answer

A. Entity extraction


Question 11

FILL IN THE BLANK

__________ AI can generate new content such as text, images, and code.


Correct Answer

Generative

or

Generative AI


Question 12

Which statement about cloud-based AI services is TRUE?

A. They reduce the need to manage physical infrastructure
B. They eliminate all cybersecurity risks
C. They only work offline
D. They cannot scale automatically


Correct Answer

A. They reduce the need to manage physical infrastructure


Question 13

You need an AI solution that can convert customer emails into concise summaries.

Which capability is MOST appropriate?

A. Text summarization
B. Regression
C. Object detection
D. Forecasting


Correct Answer

A. Text summarization


Question 14

Which machine learning technique is used when training data already contains labeled outcomes?

A. Supervised learning
B. Unsupervised learning
C. Reinforcement learning
D. OCR


Correct Answer

A. Supervised learning


Question 15

MULTIPLE ANSWER

Which capabilities are commonly associated with Azure AI Vision services?

Select ALL that apply.

A. OCR
B. Image captioning
C. Object detection
D. Sentiment analysis
E. Facial analysis


Correct Answers

A. OCR
B. Image captioning
C. Object detection
E. Facial analysis


Question 16

Which Responsible AI principle focuses on ensuring AI systems operate dependably?

A. Reliability and safety
B. Transparency
C. Inclusiveness
D. Fairness


Correct Answer

A. Reliability and safety


Question 17

You want an AI assistant to always respond in a formal tone.

What is the BEST way to accomplish this?

A. Use a system prompt
B. Use OCR
C. Increase screen resolution
D. Disable APIs


Correct Answer

A. Use a system prompt


Question 18

HOTSPOT / MATCHING

Match the AI workload to the corresponding example.

WorkloadExample
Classification?
Speech synthesis?
Clustering?

Options:

  • Grouping similar customers
  • Converting text into spoken audio
  • Determining whether a loan is approved

Correct Answers

WorkloadExample
ClassificationDetermining whether a loan is approved
Speech synthesisConverting text into spoken audio
ClusteringGrouping similar customers

Question 19

Which capability allows AI systems to create captions that describe images?

A. Image captioning
B. OCR
C. Translation
D. Regression


Correct Answer

A. Image captioning


Question 20

Which statement about Responsible AI is TRUE?

A. AI systems should be monitored and governed responsibly
B. AI systems never require human oversight
C. Responsible AI eliminates all risks
D. Responsible AI only applies to generative AI


Correct Answer

A. AI systems should be monitored and governed responsibly


Question 21

A transportation company wants to analyze dashcam footage to detect traffic signs and pedestrians.

Which workload is MOST appropriate?

A. Computer vision
B. Regression
C. Forecasting
D. Translation


Correct Answer

A. Computer vision


Question 22

MULTIPLE ANSWER

Which are common capabilities of natural language processing solutions?

Select ALL that apply.

A. Translation
B. Sentiment analysis
C. Entity recognition
D. Object detection
E. Summarization


Correct Answers

A. Translation
B. Sentiment analysis
C. Entity recognition
E. Summarization


Question 23

Which Responsible AI principle emphasizes understanding and explaining AI decisions?

A. Transparency
B. Reliability
C. Privacy
D. Inclusiveness


Correct Answer

A. Transparency


Question 24

FILL IN THE BLANK

__________ recognition converts spoken words into text.


Correct Answer

Speech

or

Speech recognition


Question 25

A business wants AI-generated voice responses for a customer-service phone system.

Which capability should they use?

A. Speech synthesis
B. OCR
C. Classification
D. Clustering


Correct Answer

A. Speech synthesis


Question 26

Which AI capability would BEST help identify duplicate customer records?

A. Entity matching
B. Object detection
C. Speech synthesis
D. Forecasting


Correct Answer

A. Entity matching


Question 27

MULTIPLE ANSWER

Which are benefits of using pretrained generative AI models?

Select ALL that apply.

A. Faster implementation
B. Reduced training costs
C. Access to advanced capabilities
D. Guaranteed perfect outputs
E. Simplified development


Correct Answers

A. Faster implementation
B. Reduced training costs
C. Access to advanced capabilities
E. Simplified development


Question 28

You are developing a lightweight application that sends prompts to an Azure AI model and receives responses.

What is the MOST likely communication method?

A. REST API calls
B. HDMI connections
C. USB transfers
D. Printer drivers


Correct Answer

A. REST API calls


Question 29

Which AI capability can identify the main topics discussed in a document?

A. Keyword extraction
B. Speech synthesis
C. Object detection
D. Regression


Correct Answer

A. Keyword extraction


Question 30

SCENARIO-BASED QUESTION

A media company wants an AI solution that:

  • Generates promotional images
  • Converts podcast audio into transcripts
  • Detects company logos in uploaded photos
  • Summarizes interview articles

Which AI capabilities are required?

A. Image generation, speech recognition, object detection, and text summarization
B. Regression and clustering only
C. Forecasting and classification only
D. OCR only


Correct Answer

A. Image generation, speech recognition, object detection, and text summarization


Explanation

The scenario requires multiple AI capabilities:

  • Image generation for promotional graphics
  • Speech recognition for podcast transcription
  • Object detection for logo identification
  • Text summarization for interview summaries

AI-901: Microsoft Azure AI Fundamentals – Practice Exam #3 (30 Questions)

Question 1

Which machine learning technique is commonly used to predict whether an email is spam?

A. Regression
B. Classification
C. Clustering
D. OCR


Correct Answer

B. Classification


Explanation

Classification predicts categories or labels such as spam/not spam, approved/denied, or fraud/not fraud.


Question 2

A company wants an AI system that can automatically generate product descriptions for an online store.

Which AI capability should be used?

A. Generative AI
B. Regression
C. OCR
D. Clustering


Correct Answer

A. Generative AI


Question 3

Which Responsible AI principle focuses on making AI systems accessible to people with varying abilities?

A. Fairness
B. Inclusiveness
C. Accountability
D. Transparency


Correct Answer

B. Inclusiveness


Question 4

Which Azure AI capability is MOST useful for converting scanned paper documents into searchable text?

A. OCR
B. Object detection
C. Speech synthesis
D. Regression


Correct Answer

A. OCR


Question 5

HOTSPOT / MATCHING

Match the AI scenario with the correct capability.

ScenarioCapability
Predicting next month’s sales?
Grouping similar customers?
Determining if a review is positive or negative?

Options:

  • Clustering
  • Regression
  • Sentiment analysis

Correct Answers

ScenarioCapability
Predicting next month’s salesRegression
Grouping similar customersClustering
Determining if a review is positive or negativeSentiment analysis

Question 6

What is the PRIMARY benefit of pretrained AI models?

A. They eliminate all errors
B. They reduce the need to build models from scratch
C. They require no internet connectivity
D. They replace APIs


Correct Answer

B. They reduce the need to build models from scratch


Question 7

Which AI capability identifies people, products, or objects within images?

A. Object detection
B. Sentiment analysis
C. Speech recognition
D. Translation


Correct Answer

A. Object detection


Question 8

MULTIPLE ANSWER

Which are examples of generative AI outputs?

Select ALL that apply.

A. AI-generated images
B. AI-written summaries
C. AI-created code
D. Printed paper forms
E. AI-generated marketing emails


Correct Answers

A. AI-generated images
B. AI-written summaries
C. AI-created code
E. AI-generated marketing emails


Question 9

What is temperature commonly used for in generative AI models?

A. To control response randomness and creativity
B. To measure server heat
C. To encrypt AI outputs
D. To improve monitor resolution


Correct Answer

A. To control response randomness and creativity


Question 10

A business wants an AI system that can translate customer support chats into multiple languages.

Which workload should they use?

A. Natural language processing
B. Object detection
C. Regression
D. Clustering


Correct Answer

A. Natural language processing


Question 11

FILL IN THE BLANK

__________ AI models can process multiple input types such as text, images, and audio.


Correct Answer

Multimodal


Question 12

Which statement about Azure AI Foundry is TRUE?

A. It can be used to deploy and manage AI models
B. It only supports spreadsheet applications
C. It replaces operating systems
D. It only works for computer vision models


Correct Answer

A. It can be used to deploy and manage AI models


Question 13

You need an AI application that can identify important phrases within documents.

Which capability should you use?

A. Keyword extraction
B. Regression
C. Clustering
D. Object detection


Correct Answer

A. Keyword extraction


Question 14

Which type of machine learning groups data based on similarities without predefined labels?

A. Regression
B. Classification
C. Clustering
D. OCR


Correct Answer

C. Clustering


Question 15

MULTIPLE ANSWER

Which are common uses of speech AI services?

Select ALL that apply.

A. Speech-to-text conversion
B. Text-to-speech conversion
C. Real-time translation
D. Predicting stock prices
E. Speaker recognition


Correct Answers

A. Speech-to-text conversion
B. Text-to-speech conversion
C. Real-time translation
E. Speaker recognition


Question 16

Which Responsible AI principle emphasizes understanding how AI systems operate?

A. Transparency
B. Reliability
C. Inclusiveness
D. Privacy


Correct Answer

A. Transparency


Question 17

You are creating a chatbot that should answer politely and professionally.

Which prompt type is BEST for defining the chatbot’s behavior?

A. System prompt
B. User prompt
C. Regression prompt
D. OCR prompt


Correct Answer

A. System prompt


Question 18

HOTSPOT / MATCHING

Match the capability to the appropriate scenario.

CapabilityScenario
OCR?
Speech synthesis?
Image classification?

Options:

  • Categorizing animal photos
  • Reading text from receipts
  • Generating spoken responses

Correct Answers

CapabilityScenario
OCRReading text from receipts
Speech synthesisGenerating spoken responses
Image classificationCategorizing animal photos

Question 19

Which AI workload would BEST identify the language used in a sentence?

A. Natural language processing
B. Regression
C. Forecasting
D. Clustering


Correct Answer

A. Natural language processing


Question 20

What is one limitation of generative AI models?

A. They may produce inaccurate information
B. They cannot process text
C. They only work offline
D. They cannot generate images


Correct Answer

A. They may produce inaccurate information


Question 21

A company wants to monitor customer opinions on social media posts.

Which AI capability should they use?

A. Sentiment analysis
B. Object detection
C. OCR
D. Regression


Correct Answer

A. Sentiment analysis


Question 22

MULTIPLE ANSWER

Which tasks are examples of computer vision workloads?

Select ALL that apply.

A. Detecting objects in images
B. Recognizing handwritten text
C. Identifying image categories
D. Translating speech
E. Facial analysis


Correct Answers

A. Detecting objects in images
B. Recognizing handwritten text
C. Identifying image categories
E. Facial analysis


Question 23

Which Responsible AI principle ensures organizations remain responsible for AI decisions and outcomes?

A. Accountability
B. Transparency
C. Inclusiveness
D. Fairness


Correct Answer

A. Accountability


Question 24

FILL IN THE BLANK

__________ analysis evaluates whether text expresses positive, negative, or neutral emotions.


Correct Answer

Sentiment

or

Sentiment analysis


Question 25

Which AI capability would BEST help summarize a recorded business meeting?

A. Speech recognition combined with text summarization
B. Object detection only
C. Regression only
D. Clustering only


Correct Answer

A. Speech recognition combined with text summarization


Question 26

A retailer wants an AI model that automatically creates advertising images from written prompts.

Which capability is MOST appropriate?

A. Image-generation models
B. Forecasting models
C. Regression models
D. OCR models


Correct Answer

A. Image-generation models


Question 27

MULTIPLE ANSWER

Which are advantages of lightweight AI applications?

Select ALL that apply.

A. Faster development
B. Reduced infrastructure complexity
C. Easier cloud integration
D. Guaranteed perfect AI accuracy
E. Simplified deployment


Correct Answers

A. Faster development
B. Reduced infrastructure complexity
C. Easier cloud integration
E. Simplified deployment


Question 28

You want an AI application to analyze uploaded images and answer questions about them.

Which type of model is MOST appropriate?

A. Multimodal model
B. Regression model
C. Clustering model
D. Forecasting model


Correct Answer

A. Multimodal model


Question 29

Which statement about APIs in AI solutions is TRUE?

A. APIs allow applications to exchange data with AI services
B. APIs eliminate the need for authentication
C. APIs physically store microphones
D. APIs replace cloud services


Correct Answer

A. APIs allow applications to exchange data with AI services


Question 30

SCENARIO-BASED QUESTION

A financial company wants an AI solution that:

  • Reads text from loan applications
  • Detects signatures in uploaded forms
  • Converts recorded customer calls into text
  • Generates automated email responses

Which AI capabilities are required?

A. OCR, object detection, speech recognition, and generative AI
B. Regression and clustering only
C. Forecasting and translation only
D. Speech synthesis only


Correct Answer

A. OCR, object detection, speech recognition, and generative AI


Explanation

The scenario requires multiple AI capabilities:

  • OCR for reading application text
  • Object detection for identifying signatures
  • Speech recognition for transcribing calls
  • Generative AI for automated email creation