This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Implement AI solutions by using Microsoft Foundry (55–60%)
   --> Implement generative AI apps and agents by using Foundry
      --> Deploy a model and interact with it in the Foundry Portal

Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Microsoft Azure AI Foundry provides a centralized environment for building, testing, deploying, and managing generative AI models and AI-powered applications. For the AI-901 certification exam, candidates should understand the basic process of deploying AI models and interacting with them through the Foundry portal.

This topic focuses on how developers and AI practitioners use Azure AI Foundry to deploy generative AI models, test prompts, configure model settings, and interact with deployed AI endpoints.

This topic falls under the “Implement generative AI apps and agents by using Foundry” section of the AI-901 exam objectives.

What Is Azure AI Foundry?

Azure AI Foundry is Microsoft’s platform for building and managing AI applications and agents.

Azure AI Foundry provides tools to:

Explore AI models
Deploy models
Test prompts
Configure AI behavior
Evaluate responses
Monitor AI applications
Manage AI resources

It supports generative AI development using Azure-hosted AI services and models.

What Does “Deploying a Model” Mean?

Deploying a model means making the AI model available for use.

A deployed model can:

Receive prompts
Process requests
Generate responses
Be accessed through applications or APIs

Deployment creates an endpoint that applications can use to interact with the model.

What Is a Model Endpoint?

An endpoint is a network-accessible interface that allows applications or users to communicate with a deployed AI model.

Applications send requests to the endpoint and receive AI-generated responses.

Common Deployment Scenarios

Organizations deploy models for many purposes, including:

Chatbots
AI assistants
Document summarization
Content generation
Customer support systems
Code generation
Data extraction

Azure AI Foundry Workflow

A simplified workflow in Azure AI Foundry typically includes:

Create or access an Azure AI resource
Open Azure AI Foundry portal
Select a model
Configure deployment settings
Deploy the model
Test prompts
Interact with the model
Integrate the endpoint into applications

Accessing the Foundry Portal

Users access Azure AI Foundry through a web-based portal.

The portal provides graphical tools for:

Model selection
Prompt testing
Deployment management
Performance monitoring

Exploring Available Models

Azure AI Foundry allows users to browse available models.

Examples may include:

Large Language Models (LLMs)
Image-generation models
Embedding models
Speech models

Models may vary by:

Size
Performance
Cost
Supported capabilities

Selecting a Model

Users choose models based on application requirements.

Factors may include:

Accuracy
Speed
Cost
Context window size
Multimodal support
Language support

Example Scenario

A company building a customer support chatbot may choose a conversational large language model.

Deploying a Model in Foundry

The deployment process usually involves:

Selecting a model
Naming the deployment
Choosing deployment settings
Allocating resources
Creating the endpoint

Deployment Names

Deployments are typically assigned unique names.

Example

support-chat-model

Applications use deployment names when sending requests.

Model Configuration Options

During deployment, users may configure:

Model version
Scaling options
Authentication settings
Content filters
Region
Resource allocation

Content Filtering and Safety

Azure AI Foundry includes Responsible AI safety features.

Content filtering helps reduce:

Harmful outputs
Offensive content
Unsafe responses

This is important for enterprise AI applications.

Interacting with a Deployed Model

After deployment, users can interact with the model directly within the Foundry portal.

This often includes:

Entering prompts
Viewing responses
Adjusting settings
Testing behavior

Playground Interfaces

Azure AI Foundry provides playground environments for experimentation.

Playgrounds allow users to:

Test prompts
Compare outputs
Tune settings
Evaluate model behavior

Prompt Testing

Users can experiment with:

System prompts
User prompts
Formatting instructions
Role prompting

Prompt testing helps improve AI response quality.

Example Prompt Interaction

User Prompt

“Summarize this customer feedback in three bullet points.”

Model Response

The model generates a summarized response.

Model Parameters

Foundry portals may allow adjustment of model parameters such as:

Temperature
Maximum tokens
Top-p sampling

Temperature

Temperature controls response randomness.

Low Temperature	High Temperature
More predictable	More creative
More focused	More varied

Maximum Tokens

Maximum tokens limit response length.

Smaller limits create shorter responses.

System Prompts in Foundry

Users can configure system prompts to guide AI behavior.

Example System Prompt

“You are a professional technical support assistant. Keep responses concise and helpful.”

System prompts influence:

Tone
Style
Safety
Formatting

Evaluating Responses

Users should evaluate AI outputs for:

Accuracy
Relevance
Safety
Bias
Hallucinations

AI-generated content should be reviewed carefully.

Hallucinations

Generative AI models can produce incorrect or fabricated information.

These incorrect outputs are called hallucinations.

Prompt engineering and grounding techniques help reduce hallucinations.

API Access

Once deployed, applications can connect to the model endpoint using APIs.

This allows developers to integrate AI into applications.

Common Integration Scenarios

Applications may use deployed models for:

Chat interfaces
Search assistants
Document analysis
AI copilots
Workflow automation

Monitoring and Management

Azure AI Foundry supports monitoring deployed models.

Monitoring may include:

Usage tracking
Performance analysis
Error monitoring
Cost management

Scaling AI Deployments

Organizations may scale deployments to support:

More users
Higher request volumes
Faster response times

Cloud-based deployments support elastic scaling.

Responsible AI Considerations

When deploying AI models, organizations should consider:

Privacy
Security
Fairness
Transparency
Safety
Compliance

Generative AI applications should include safeguards against misuse.

Authentication and Security

Deployed models typically require secure authentication.

Security features may include:

API keys
Identity management
Access control

Common Challenges

Organizations may encounter challenges such as:

High usage costs
Latency
Hallucinations
Unsafe outputs
Poor prompt quality

Proper testing and monitoring are important.

Azure OpenAI Service

Azure OpenAI Service provides access to powerful generative AI models that can be deployed and managed through Azure AI Foundry.

Real-World Scenarios

Scenario 1: Customer Support Chatbot

Goal

Deploy a conversational AI assistant.

Activities

Deploy language model
Configure system prompts
Test responses in the playground

Scenario 2: Internal Knowledge Assistant

Goal

Allow employees to ask questions about company documentation.

Activities

Deploy AI model
Configure prompts
Integrate with enterprise systems

Scenario 3: Marketing Content Generator

Goal

Generate product descriptions automatically.

Activities

Deploy generative AI model
Test prompt variations
Evaluate response quality

Important AI-901 Exam Tips

For the exam, remember these key points:

Deploying a model makes it available for use.
Deployments create accessible endpoints.
Azure AI Foundry provides tools for testing and managing models.
Playgrounds allow prompt experimentation.
System prompts guide model behavior.
Temperature controls creativity and randomness.
Maximum tokens control response length.
AI outputs should be evaluated for accuracy and safety.
Content filtering supports Responsible AI practices.
APIs allow applications to connect to deployed models.

Quick Knowledge Check

Question 1

What does deploying a model do?

Answer

It makes the AI model available for use through an endpoint.

Question 2

What is the purpose of a playground in Azure AI Foundry?

Answer

To test prompts and interact with deployed models.

Question 3

What does the temperature setting control?

Answer

The randomness and creativity of model responses.

Question 4

Why are content filters important?

Answer

They help reduce harmful or unsafe AI-generated outputs.

Practice Exam Questions

Question 1

What is the PRIMARY purpose of deploying an AI model?

A. To permanently delete the model
B. To make the model available for use through an endpoint
C. To compress training data
D. To convert images into text

Correct Answer

B. To make the model available for use through an endpoint

Explanation

Deploying a model makes it accessible so applications and users can interact with it.

Why the Other Answers Are Incorrect

A. To permanently delete the model

Deployment does not delete models.

C. To compress training data

Deployment is unrelated to data compression.

D. To convert images into text

This describes OCR.

Question 2

What is an endpoint in the context of AI model deployment?

A. A physical server room
B. A network-accessible interface for interacting with a deployed model
C. A type of database backup
D. A computer vision algorithm

Correct Answer

B. A network-accessible interface for interacting with a deployed model

Explanation

Endpoints allow applications and users to send requests to deployed AI models and receive responses.

Why the Other Answers Are Incorrect

A. A physical server room

Endpoints are logical interfaces, not physical locations.

C. A type of database backup

This is unrelated to AI deployment.

D. A computer vision algorithm

Endpoints are not algorithms.

Question 3

Which Azure tool provides playgrounds for testing prompts and interacting with deployed AI models?

A. Azure SQL Database
B. Azure AI Foundry
C. Microsoft Excel
D. Azure Virtual Desktop

Correct Answer

B. Azure AI Foundry

Explanation

Azure AI Foundry provides tools for model deployment, prompt testing, evaluation, and management.

Why the Other Answers Are Incorrect

A. Azure SQL Database

This is a database service.

C. Microsoft Excel

Excel is not an AI deployment platform.

D. Azure Virtual Desktop

This provides desktop virtualization services.

Question 4

What is the PRIMARY purpose of a playground in Azure AI Foundry?

A. Hosting multiplayer games
B. Experimenting with prompts and testing model behavior
C. Managing employee payroll
D. Compressing image files

Correct Answer

B. Experimenting with prompts and testing model behavior

Explanation

Playgrounds allow users to interact with models, test prompts, and evaluate responses.

Why the Other Answers Are Incorrect

A. Hosting multiplayer games

This is unrelated to AI Foundry.

C. Managing employee payroll

This is unrelated to AI development.

D. Compressing image files

Playgrounds are not image utilities.

Question 5

Which configuration setting controls how creative or random AI-generated responses are?

A. OCR level
B. Temperature
C. Resolution scaling
D. Data indexing

Correct Answer

B. Temperature

Explanation

Temperature controls randomness and creativity in generative AI responses.

Why the Other Answers Are Incorrect

A. OCR level

OCR extracts text from images.

C. Resolution scaling

This relates to images, not text generation randomness.

D. Data indexing

Indexing is unrelated to generative response creativity.

Question 6

What is the effect of setting a lower temperature value in a generative AI model?

A. More random responses
B. More predictable and focused responses
C. Faster internet speeds
D. Larger image generation sizes

Correct Answer

B. More predictable and focused responses

Explanation

Lower temperature settings reduce randomness and produce more deterministic outputs.

Why the Other Answers Are Incorrect

A. More random responses

Higher temperatures increase randomness.

C. Faster internet speeds

Temperature does not affect networking.

D. Larger image generation sizes

Temperature is unrelated to image dimensions.

Question 7

Which prompt type defines the AI assistant’s behavior, tone, and rules?

A. User prompt
B. System prompt
C. SQL query
D. OCR prompt

Correct Answer

B. System prompt

Explanation

System prompts provide high-level behavioral instructions to the AI model.

Why the Other Answers Are Incorrect

A. User prompt

User prompts specify tasks or requests.

C. SQL query

SQL queries interact with databases.

D. OCR prompt

OCR is unrelated to conversational AI behavior.

Question 8

Why are content filters important when deploying generative AI models?

A. They improve internet bandwidth
B. They help reduce harmful or unsafe outputs
C. They increase monitor resolution
D. They replace system prompts entirely

Correct Answer

B. They help reduce harmful or unsafe outputs

Explanation

Content filtering supports Responsible AI by helping prevent harmful or inappropriate AI-generated content.

Why the Other Answers Are Incorrect

A. They improve internet bandwidth

Content filters do not affect networking performance.

C. They increase monitor resolution

This is unrelated to AI safety.

D. They replace system prompts entirely

Content filters complement prompts; they do not replace them.

Question 9

What are hallucinations in generative AI?

A. Physical hardware failures
B. Incorrect or fabricated AI-generated information
C. Database replication errors
D. Unauthorized user logins

Correct Answer

B. Incorrect or fabricated AI-generated information

Explanation

Hallucinations occur when AI generates inaccurate or invented information.

Why the Other Answers Are Incorrect

A. Physical hardware failures

This is unrelated to AI hallucinations.

C. Database replication errors

This is a database issue.

D. Unauthorized user logins

This is a security issue.

Question 10

After deploying a model, how do external applications typically interact with it?

A. Through handwritten forms
B. Through APIs connected to the deployment endpoint
C. Through spreadsheet imports only
D. Through local USB connections

Correct Answer

B. Through APIs connected to the deployment endpoint

Explanation

Applications commonly communicate with deployed AI models using APIs and endpoints.

Why the Other Answers Are Incorrect

A. Through handwritten forms

This is unrelated to AI deployment.

C. Through spreadsheet imports only

Spreadsheets are not the primary integration mechanism.

D. Through local USB connections

Cloud AI services typically use network-based APIs, not USB connections.

Final Thoughts

Deploying and interacting with AI models in Azure AI Foundry is an important skill area for the AI-901 certification exam. Microsoft expects candidates to understand the basic deployment workflow, prompt testing process, model configuration options, and Responsible AI considerations involved in building generative AI applications.

Azure AI Foundry simplifies AI development by providing a centralized environment for deploying, testing, and managing AI models and agents.

Go to the AI-901 Exam Prep Hub main page