This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub.
This topic falls under these sections:
Implement AI solutions by using Microsoft Foundry (55–60%)
--> Implement generative AI apps and agents by using Foundry
--> Deploy a model and interact with it in the Foundry Portal
Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.
Microsoft Azure AI Foundry provides a centralized environment for building, testing, deploying, and managing generative AI models and AI-powered applications. For the AI-901 certification exam, candidates should understand the basic process of deploying AI models and interacting with them through the Foundry portal.
This topic focuses on how developers and AI practitioners use Azure AI Foundry to deploy generative AI models, test prompts, configure model settings, and interact with deployed AI endpoints.
This topic falls under the “Implement generative AI apps and agents by using Foundry” section of the AI-901 exam objectives.
What Is Azure AI Foundry?
Azure AI Foundry is Microsoft’s platform for building and managing AI applications and agents.
Azure AI Foundry provides tools to:
- Explore AI models
- Deploy models
- Test prompts
- Configure AI behavior
- Evaluate responses
- Monitor AI applications
- Manage AI resources
It supports generative AI development using Azure-hosted AI services and models.
What Does “Deploying a Model” Mean?
Deploying a model means making the AI model available for use.
A deployed model can:
- Receive prompts
- Process requests
- Generate responses
- Be accessed through applications or APIs
Deployment creates an endpoint that applications can use to interact with the model.
What Is a Model Endpoint?
An endpoint is a network-accessible interface that allows applications or users to communicate with a deployed AI model.
Applications send requests to the endpoint and receive AI-generated responses.
Common Deployment Scenarios
Organizations deploy models for many purposes, including:
- Chatbots
- AI assistants
- Document summarization
- Content generation
- Customer support systems
- Code generation
- Data extraction
Azure AI Foundry Workflow
A simplified workflow in Azure AI Foundry typically includes:
- Create or access an Azure AI resource
- Open Azure AI Foundry portal
- Select a model
- Configure deployment settings
- Deploy the model
- Test prompts
- Interact with the model
- Integrate the endpoint into applications
Accessing the Foundry Portal
Users access Azure AI Foundry through a web-based portal.
The portal provides graphical tools for:
- Model selection
- Prompt testing
- Deployment management
- Performance monitoring
Exploring Available Models
Azure AI Foundry allows users to browse available models.
Examples may include:
- Large Language Models (LLMs)
- Image-generation models
- Embedding models
- Speech models
Models may vary by:
- Size
- Performance
- Cost
- Supported capabilities
Selecting a Model
Users choose models based on application requirements.
Factors may include:
- Accuracy
- Speed
- Cost
- Context window size
- Multimodal support
- Language support
Example Scenario
A company building a customer support chatbot may choose a conversational large language model.
Deploying a Model in Foundry
The deployment process usually involves:
- Selecting a model
- Naming the deployment
- Choosing deployment settings
- Allocating resources
- Creating the endpoint
Deployment Names
Deployments are typically assigned unique names.
Example
support-chat-model
Applications use deployment names when sending requests.
Model Configuration Options
During deployment, users may configure:
- Model version
- Scaling options
- Authentication settings
- Content filters
- Region
- Resource allocation
Content Filtering and Safety
Azure AI Foundry includes Responsible AI safety features.
Content filtering helps reduce:
- Harmful outputs
- Offensive content
- Unsafe responses
This is important for enterprise AI applications.
Interacting with a Deployed Model
After deployment, users can interact with the model directly within the Foundry portal.
This often includes:
- Entering prompts
- Viewing responses
- Adjusting settings
- Testing behavior
Playground Interfaces
Azure AI Foundry provides playground environments for experimentation.
Playgrounds allow users to:
- Test prompts
- Compare outputs
- Tune settings
- Evaluate model behavior
Prompt Testing
Users can experiment with:
- System prompts
- User prompts
- Formatting instructions
- Role prompting
Prompt testing helps improve AI response quality.
Example Prompt Interaction
User Prompt
“Summarize this customer feedback in three bullet points.”
Model Response
The model generates a summarized response.
Model Parameters
Foundry portals may allow adjustment of model parameters such as:
- Temperature
- Maximum tokens
- Top-p sampling
Temperature
Temperature controls response randomness.
| Low Temperature | High Temperature |
|---|---|
| More predictable | More creative |
| More focused | More varied |
Maximum Tokens
Maximum tokens limit response length.
Smaller limits create shorter responses.
System Prompts in Foundry
Users can configure system prompts to guide AI behavior.
Example System Prompt
“You are a professional technical support assistant. Keep responses concise and helpful.”
System prompts influence:
- Tone
- Style
- Safety
- Formatting
Evaluating Responses
Users should evaluate AI outputs for:
- Accuracy
- Relevance
- Safety
- Bias
- Hallucinations
AI-generated content should be reviewed carefully.
Hallucinations
Generative AI models can produce incorrect or fabricated information.
These incorrect outputs are called hallucinations.
Prompt engineering and grounding techniques help reduce hallucinations.
API Access
Once deployed, applications can connect to the model endpoint using APIs.
This allows developers to integrate AI into applications.
Common Integration Scenarios
Applications may use deployed models for:
- Chat interfaces
- Search assistants
- Document analysis
- AI copilots
- Workflow automation
Monitoring and Management
Azure AI Foundry supports monitoring deployed models.
Monitoring may include:
- Usage tracking
- Performance analysis
- Error monitoring
- Cost management
Scaling AI Deployments
Organizations may scale deployments to support:
- More users
- Higher request volumes
- Faster response times
Cloud-based deployments support elastic scaling.
Responsible AI Considerations
When deploying AI models, organizations should consider:
- Privacy
- Security
- Fairness
- Transparency
- Safety
- Compliance
Generative AI applications should include safeguards against misuse.
Authentication and Security
Deployed models typically require secure authentication.
Security features may include:
- API keys
- Identity management
- Access control
Common Challenges
Organizations may encounter challenges such as:
- High usage costs
- Latency
- Hallucinations
- Unsafe outputs
- Poor prompt quality
Proper testing and monitoring are important.
Azure OpenAI Service
Azure OpenAI Service provides access to powerful generative AI models that can be deployed and managed through Azure AI Foundry.
Real-World Scenarios
Scenario 1: Customer Support Chatbot
Goal
Deploy a conversational AI assistant.
Activities
- Deploy language model
- Configure system prompts
- Test responses in the playground
Scenario 2: Internal Knowledge Assistant
Goal
Allow employees to ask questions about company documentation.
Activities
- Deploy AI model
- Configure prompts
- Integrate with enterprise systems
Scenario 3: Marketing Content Generator
Goal
Generate product descriptions automatically.
Activities
- Deploy generative AI model
- Test prompt variations
- Evaluate response quality
Important AI-901 Exam Tips
For the exam, remember these key points:
- Deploying a model makes it available for use.
- Deployments create accessible endpoints.
- Azure AI Foundry provides tools for testing and managing models.
- Playgrounds allow prompt experimentation.
- System prompts guide model behavior.
- Temperature controls creativity and randomness.
- Maximum tokens control response length.
- AI outputs should be evaluated for accuracy and safety.
- Content filtering supports Responsible AI practices.
- APIs allow applications to connect to deployed models.
Quick Knowledge Check
Question 1
What does deploying a model do?
Answer
It makes the AI model available for use through an endpoint.
Question 2
What is the purpose of a playground in Azure AI Foundry?
Answer
To test prompts and interact with deployed models.
Question 3
What does the temperature setting control?
Answer
The randomness and creativity of model responses.
Question 4
Why are content filters important?
Answer
They help reduce harmful or unsafe AI-generated outputs.
Practice Exam Questions
Question 1
What is the PRIMARY purpose of deploying an AI model?
A. To permanently delete the model
B. To make the model available for use through an endpoint
C. To compress training data
D. To convert images into text
Correct Answer
B. To make the model available for use through an endpoint
Explanation
Deploying a model makes it accessible so applications and users can interact with it.
Why the Other Answers Are Incorrect
A. To permanently delete the model
Deployment does not delete models.
C. To compress training data
Deployment is unrelated to data compression.
D. To convert images into text
This describes OCR.
Question 2
What is an endpoint in the context of AI model deployment?
A. A physical server room
B. A network-accessible interface for interacting with a deployed model
C. A type of database backup
D. A computer vision algorithm
Correct Answer
B. A network-accessible interface for interacting with a deployed model
Explanation
Endpoints allow applications and users to send requests to deployed AI models and receive responses.
Why the Other Answers Are Incorrect
A. A physical server room
Endpoints are logical interfaces, not physical locations.
C. A type of database backup
This is unrelated to AI deployment.
D. A computer vision algorithm
Endpoints are not algorithms.
Question 3
Which Azure tool provides playgrounds for testing prompts and interacting with deployed AI models?
A. Azure SQL Database
B. Azure AI Foundry
C. Microsoft Excel
D. Azure Virtual Desktop
Correct Answer
B. Azure AI Foundry
Explanation
Azure AI Foundry provides tools for model deployment, prompt testing, evaluation, and management.
Why the Other Answers Are Incorrect
A. Azure SQL Database
This is a database service.
C. Microsoft Excel
Excel is not an AI deployment platform.
D. Azure Virtual Desktop
This provides desktop virtualization services.
Question 4
What is the PRIMARY purpose of a playground in Azure AI Foundry?
A. Hosting multiplayer games
B. Experimenting with prompts and testing model behavior
C. Managing employee payroll
D. Compressing image files
Correct Answer
B. Experimenting with prompts and testing model behavior
Explanation
Playgrounds allow users to interact with models, test prompts, and evaluate responses.
Why the Other Answers Are Incorrect
A. Hosting multiplayer games
This is unrelated to AI Foundry.
C. Managing employee payroll
This is unrelated to AI development.
D. Compressing image files
Playgrounds are not image utilities.
Question 5
Which configuration setting controls how creative or random AI-generated responses are?
A. OCR level
B. Temperature
C. Resolution scaling
D. Data indexing
Correct Answer
B. Temperature
Explanation
Temperature controls randomness and creativity in generative AI responses.
Why the Other Answers Are Incorrect
A. OCR level
OCR extracts text from images.
C. Resolution scaling
This relates to images, not text generation randomness.
D. Data indexing
Indexing is unrelated to generative response creativity.
Question 6
What is the effect of setting a lower temperature value in a generative AI model?
A. More random responses
B. More predictable and focused responses
C. Faster internet speeds
D. Larger image generation sizes
Correct Answer
B. More predictable and focused responses
Explanation
Lower temperature settings reduce randomness and produce more deterministic outputs.
Why the Other Answers Are Incorrect
A. More random responses
Higher temperatures increase randomness.
C. Faster internet speeds
Temperature does not affect networking.
D. Larger image generation sizes
Temperature is unrelated to image dimensions.
Question 7
Which prompt type defines the AI assistant’s behavior, tone, and rules?
A. User prompt
B. System prompt
C. SQL query
D. OCR prompt
Correct Answer
B. System prompt
Explanation
System prompts provide high-level behavioral instructions to the AI model.
Why the Other Answers Are Incorrect
A. User prompt
User prompts specify tasks or requests.
C. SQL query
SQL queries interact with databases.
D. OCR prompt
OCR is unrelated to conversational AI behavior.
Question 8
Why are content filters important when deploying generative AI models?
A. They improve internet bandwidth
B. They help reduce harmful or unsafe outputs
C. They increase monitor resolution
D. They replace system prompts entirely
Correct Answer
B. They help reduce harmful or unsafe outputs
Explanation
Content filtering supports Responsible AI by helping prevent harmful or inappropriate AI-generated content.
Why the Other Answers Are Incorrect
A. They improve internet bandwidth
Content filters do not affect networking performance.
C. They increase monitor resolution
This is unrelated to AI safety.
D. They replace system prompts entirely
Content filters complement prompts; they do not replace them.
Question 9
What are hallucinations in generative AI?
A. Physical hardware failures
B. Incorrect or fabricated AI-generated information
C. Database replication errors
D. Unauthorized user logins
Correct Answer
B. Incorrect or fabricated AI-generated information
Explanation
Hallucinations occur when AI generates inaccurate or invented information.
Why the Other Answers Are Incorrect
A. Physical hardware failures
This is unrelated to AI hallucinations.
C. Database replication errors
This is a database issue.
D. Unauthorized user logins
This is a security issue.
Question 10
After deploying a model, how do external applications typically interact with it?
A. Through handwritten forms
B. Through APIs connected to the deployment endpoint
C. Through spreadsheet imports only
D. Through local USB connections
Correct Answer
B. Through APIs connected to the deployment endpoint
Explanation
Applications commonly communicate with deployed AI models using APIs and endpoints.
Why the Other Answers Are Incorrect
A. Through handwritten forms
This is unrelated to AI deployment.
C. Through spreadsheet imports only
Spreadsheets are not the primary integration mechanism.
D. Through local USB connections
Cloud AI services typically use network-based APIs, not USB connections.
Final Thoughts
Deploying and interacting with AI models in Azure AI Foundry is an important skill area for the AI-901 certification exam. Microsoft expects candidates to understand the basic deployment workflow, prompt testing process, model configuration options, and Responsible AI considerations involved in building generative AI applications.
Azure AI Foundry simplifies AI development by providing a centralized environment for deploying, testing, and managing AI models and agents.
Go to the AI-901 Exam Prep Hub main page

One thought on “Deploy a model and interact with it in the Foundry Portal (AI-901 Exam Prep)”