This post is a part of the AI-103: Develop AI Apps and Agents on Azure Exam Prep Hub.
This topic falls under these sections:
Plan and manage an Azure AI solution (25–30%)
--> Set up AI solutions in Foundry
--> Choose appropriate deployment options
Note that there are 10 practice questions (with answers and explanations) at the end of each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available from the hub's main page below the exam topics section.
Introduction
One of the most important responsibilities for Azure AI developers is selecting the correct deployment option for AI applications and agent-based solutions.
Modern AI systems can be deployed in many different ways depending on:
- Scalability requirements
- Cost constraints
- Security requirements
- Latency expectations
- Geographic distribution
- Operational complexity
- AI workload patterns
- Enterprise governance needs
The AI-103: Develop AI Apps and Agents on Azure certification exam tests your understanding of how to choose appropriate deployment options for:
- Generative AI applications
- AI agents
- APIs
- RAG systems
- Vector search solutions
- Multimodal applications
- Enterprise AI systems
For the AI-103 exam, you should understand:
- Azure deployment models
- Hosting options
- Serverless deployments
- Containerized deployments
- Kubernetes deployments
- Regional deployments
- High availability strategies
- Scaling approaches
- CI/CD deployment pipelines
- Model deployment considerations
- Infrastructure tradeoffs
What Is a Deployment Option?
A deployment option refers to the method used to host and run an AI application or service.
Deployment choices affect:
- Performance
- Reliability
- Cost
- Security
- Scalability
- Maintainability
Choosing the wrong deployment strategy can:
- Increase costs
- Reduce performance
- Complicate maintenance
- Create scaling problems
Common Azure AI Deployment Components
AI solutions commonly include:
- AI models
- APIs
- Search systems
- Databases
- Agent orchestration
- Storage systems
- Monitoring tools
- Security services
Each component may use different deployment approaches.
Azure OpenAI Deployment Options
Azure OpenAI allows developers to deploy:
- GPT models
- Embedding models
- Multimodal models
- Fine-tuned models
Deployment considerations include:
- Region availability
- Throughput requirements
- Latency requirements
- Cost optimization
- Capacity planning
Standard Deployments
What Are Standard Deployments?
Standard deployments provide shared model hosting infrastructure.
Advantages:
- Lower operational complexity
- Managed infrastructure
- Easier setup
Disadvantages:
- Shared capacity
- Potential throughput limitations
Provisioned Throughput Deployments
What Is Provisioned Throughput?
Provisioned throughput reserves dedicated processing capacity.
Advantages:
- Predictable performance
- Dedicated throughput
- Lower latency consistency
Disadvantages:
- Higher cost
- Capacity planning required
When to Use Provisioned Throughput
Use provisioned throughput when:
- Workloads are high volume
- Predictable latency is critical
- Enterprise SLAs are required
- Large-scale copilots are deployed
Serverless Deployment Options
What Is Serverless?
Serverless computing automatically manages infrastructure.
Developers focus on code instead of servers.
Azure Functions
Azure Functions provides event-driven serverless compute.
Common AI use cases:
- Tool calling
- Workflow execution
- API processing
- Lightweight orchestration
- Event-triggered AI actions
Advantages of Azure Functions
- Automatic scaling
- Pay-per-use pricing
- Rapid deployment
- Minimal infrastructure management
Limitations of Azure Functions
- Execution duration limits
- Cold starts
- Less suitable for large persistent workloads
When to Use Azure Functions
Use Azure Functions when:
- Workloads are event-driven
- Execution is lightweight
- Cost optimization is important
- Rapid scaling is required
Azure Container Apps
Azure Container Apps provides serverless container hosting.
Useful for:
- AI middleware
- APIs
- Agent orchestration
- Background workers
- Lightweight microservices
Advantages of Container Apps
- Simplified container deployment
- Autoscaling support
- Event-driven scaling
- Lower operational overhead than Kubernetes
Kubernetes Deployments
Azure Kubernetes Service (AKS)
AKS provides enterprise-grade container orchestration.
Common AI uses:
- Multi-agent systems
- Large-scale AI platforms
- Distributed AI services
- Complex orchestration
- High-volume APIs
Advantages of AKS
- High scalability
- Advanced orchestration
- Fine-grained control
- Container portability
- Enterprise-grade deployments
Limitations of AKS
- Higher operational complexity
- More infrastructure management
- Requires Kubernetes expertise
When to Use AKS
Use AKS when:
- Large-scale deployments exist
- Multiple microservices interact
- High traffic is expected
- Advanced orchestration is needed
Platform-as-a-Service (PaaS) Deployments
Azure App Service
Azure App Service hosts:
- Web apps
- APIs
- AI front ends
- Lightweight enterprise applications
Advantages of App Service
- Managed platform
- Easy deployment
- Autoscaling support
- Simplified maintenance
When to Use App Service
Use App Service when:
- Hosting AI web applications
- Managing APIs
- Rapid development is needed
- Full Kubernetes orchestration is unnecessary
Edge and Hybrid Deployments
Some AI workloads require local or hybrid deployments.
Reasons include:
- Low latency
- Regulatory requirements
- Limited connectivity
- On-premises data processing
Azure Arc
Azure Arc extends Azure management to:
- On-premises systems
- Multi-cloud environments
- Edge deployments
Useful for hybrid AI environments.
Deployment Considerations for AI Agents
AI agents often require multiple deployment layers.
Examples include:
- LLM hosting
- Retrieval systems
- Tool execution services
- Workflow orchestration
- Persistent memory systems
Multi-Service Architectures
AI agents commonly use:
- Azure OpenAI
- Azure AI Search
- Azure Functions
- Cosmos DB
- APIs
- Orchestration workflows
Different components may use different deployment options.
Geographic Deployment Considerations
AI systems may require global deployment strategies.
Regional Deployments
Deploying resources in a specific region helps:
- Reduce latency
- Meet compliance requirements
- Improve user experience
Multi-Region Deployments
Multi-region deployments improve:
- Availability
- Disaster recovery
- Global performance
Availability Zones
Availability Zones provide redundancy across isolated datacenters.
Benefits include:
- Higher uptime
- Fault tolerance
- Improved resilience
High Availability Design
Enterprise AI applications often require:
- Redundant infrastructure
- Automatic failover
- Load balancing
- Disaster recovery
Load Balancing
Azure Load Balancer and Azure Application Gateway distribute traffic across services.
Benefits:
- Scalability
- High availability
- Traffic optimization
Autoscaling
Autoscaling dynamically adjusts infrastructure based on demand.
Supported by:
- AKS
- Azure Functions
- App Service
- Container Apps
Deployment Security Considerations
Security is a major AI-103 exam topic.
Microsoft Entra ID
Microsoft Entra ID supports:
- Authentication
- Authorization
- Identity management
- RBAC
Azure Key Vault
Azure Key Vault securely stores:
- Secrets
- API keys
- Certificates
- Connection strings
Private Endpoints
Private Endpoints provide secure private connectivity between Azure services.
Useful for:
- Enterprise AI systems
- Sensitive data workloads
- Compliance-driven deployments
CI/CD for AI Deployments
What Is CI/CD?
CI/CD stands for:
- Continuous Integration
- Continuous Deployment
CI/CD automates:
- Testing
- Deployment
- Validation
- Release management
Azure DevOps
Azure DevOps supports:
- Build pipelines
- Release pipelines
- Source control
- Automated deployments
GitHub Actions
GitHub Actions supports:
- Workflow automation
- CI/CD pipelines
- Deployment automation
Commonly used for AI application deployments.
Blue-Green Deployments
Blue-green deployments reduce downtime during releases.
How it works:
- One environment remains active
- A second environment receives updates
- Traffic shifts after validation
Benefits:
- Safer releases
- Reduced downtime
- Easier rollback
Canary Deployments
Canary deployments release updates gradually to a small percentage of users.
Benefits:
- Reduced deployment risk
- Easier issue detection
- Safer experimentation
Monitoring Deployment Health
AI deployments should monitor:
- Latency
- Throughput
- Token usage
- Errors
- Model failures
- Tool call failures
- Retrieval quality
Azure Monitor
Azure Monitor provides:
- Metrics
- Logging
- Alerts
- Diagnostics
Application Insights
Application Insights supports:
- Telemetry
- Request tracing
- Dependency tracking
- Error diagnostics
Cost Optimization Considerations
AI deployments can become expensive.
Common Cost Drivers
- Token consumption
- GPU usage
- High-scale orchestration
- Search indexing
- Storage
- Data transfer
Cost Optimization Strategies
Use Smaller Models When Appropriate
Smaller models reduce:
- Compute costs
- Token usage
- Latency
Use Serverless When Appropriate
Serverless deployments reduce idle infrastructure costs.
Use Autoscaling
Autoscaling prevents overprovisioning.
Common AI-103 Deployment Scenarios
Scenario 1: Enterprise AI Chatbot
Requirements:
- High availability
- Secure authentication
- Enterprise search
Recommended Deployment:
- Azure OpenAI
- App Service
- Azure AI Search
- Entra ID
Scenario 2: Large-Scale AI Agent Platform
Requirements:
- Multiple AI agents
- Heavy orchestration
- High concurrency
Recommended Deployment:
- AKS
- Azure Functions
- Cosmos DB
- Prompt Flow
Scenario 3: Lightweight AI API
Requirements:
- Rapid deployment
- Cost optimization
- Moderate scale
Recommended Deployment:
- Azure Functions
- Container Apps
Scenario 4: Global AI Application
Requirements:
- Global users
- Low latency
- Disaster recovery
Recommended Deployment:
- Multi-region deployment
- Availability Zones
- Load balancing
Common AI-103 Exam Tips
Understand Deployment Tradeoffs
Know when to use:
- App Service vs AKS
- Functions vs Containers
- Standard vs Provisioned Throughput
Know High Availability Concepts
Understand:
- Availability Zones
- Multi-region deployments
- Load balancing
- Failover strategies
Learn Security Best Practices
Know how to use:
- Entra ID
- RBAC
- Key Vault
- Private Endpoints
Understand Agent Deployment Needs
AI agents commonly require:
- Tool orchestration
- Retrieval systems
- Persistent memory
- API integrations
Summary
Choosing the correct deployment option is critical for successful AI applications and agent-based systems.
For the AI-103 exam, you should understand:
- Azure deployment models
- Serverless deployment options
- Kubernetes deployments
- PaaS hosting options
- Multi-region architectures
- High availability design
- Security considerations
- CI/CD pipelines
- Scaling strategies
- AI deployment tradeoffs
Strong deployment architecture skills help ensure AI systems are:
- Reliable
- Scalable
- Secure
- Cost-effective
- Maintainable
Practice Exam Questions
Question 1
Which Azure service is BEST suited for enterprise-scale container orchestration for AI applications?
A. Azure App Service
B. Azure Kubernetes Service (AKS)
C. Azure DNS
D. Azure Backup
Answer
B. Azure Kubernetes Service (AKS)
Explanation
AKS provides enterprise-grade container orchestration and scalability.
Question 2
Which deployment option provides dedicated throughput capacity for Azure OpenAI models?
A. Shared deployment
B. Provisioned throughput deployment
C. Consumption deployment
D. Basic deployment
Answer
B. Provisioned throughput deployment
Explanation
Provisioned throughput reserves dedicated model processing capacity.
Question 3
Which Azure service is MOST appropriate for lightweight event-driven AI workflows?
A. Azure Functions
B. Azure Firewall
C. Azure Backup
D. Azure CDN
Answer
A. Azure Functions
Explanation
Azure Functions supports serverless event-driven execution.
Question 4
What is the primary benefit of Availability Zones?
A. Lower token usage
B. Increased embedding size
C. Improved fault tolerance
D. Reduced API authentication
Answer
C. Improved fault tolerance
Explanation
Availability Zones provide redundancy across isolated datacenters.
Question 5
Which Azure service is commonly used to host AI web applications and APIs with minimal infrastructure management?
A. Azure App Service
B. Azure Load Balancer
C. Azure DNS
D. Azure Monitor
Answer
A. Azure App Service
Explanation
Azure App Service is a managed PaaS platform for hosting web applications and APIs.
Question 6
Which deployment strategy gradually releases updates to a subset of users first?
A. Blue-green deployment
B. Canary deployment
C. Full rollback deployment
D. Batch deployment
Answer
B. Canary deployment
Explanation
Canary deployments release updates incrementally to reduce risk.
Question 7
Which Azure service securely stores API keys and secrets for AI applications?
A. Azure Key Vault
B. Azure CDN
C. Azure Firewall
D. Azure Backup
Answer
A. Azure Key Vault
Explanation
Azure Key Vault securely manages secrets and credentials.
Question 8
Which Azure deployment option is MOST appropriate for serverless container hosting?
A. Azure Container Apps
B. Azure Backup
C. Azure DNS
D. Azure Files
Answer
A. Azure Container Apps
Explanation
Azure Container Apps provides simplified serverless container deployment.
Question 9
Which deployment architecture improves global application availability and disaster recovery?
A. Single-region deployment
B. Multi-region deployment
C. Local-only deployment
D. Single-container deployment
Answer
B. Multi-region deployment
Explanation
Multi-region deployments improve resilience and geographic performance.
Question 10
Which Azure monitoring service provides application telemetry and request diagnostics?
A. Application Insights
B. Azure CDN
C. Azure DNS
D. Azure Policy
Answer
A. Application Insights
Explanation
Application Insights provides monitoring and telemetry for applications.
Go to the AI-103 Exam Prep Hub main page
