Deploy a model and interact with it in the Foundry Portal (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Implement AI solutions by using Microsoft Foundry (55–60%)
--> Implement generative AI apps and agents by using Foundry
--> Deploy a model and interact with it in the Foundry Portal


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Microsoft Azure AI Foundry provides a centralized environment for building, testing, deploying, and managing generative AI models and AI-powered applications. For the AI-901 certification exam, candidates should understand the basic process of deploying AI models and interacting with them through the Foundry portal.

This topic focuses on how developers and AI practitioners use Azure AI Foundry to deploy generative AI models, test prompts, configure model settings, and interact with deployed AI endpoints.

This topic falls under the “Implement generative AI apps and agents by using Foundry” section of the AI-901 exam objectives.


What Is Azure AI Foundry?

Azure AI Foundry is Microsoft’s platform for building and managing AI applications and agents.

Azure AI Foundry provides tools to:

  • Explore AI models
  • Deploy models
  • Test prompts
  • Configure AI behavior
  • Evaluate responses
  • Monitor AI applications
  • Manage AI resources

It supports generative AI development using Azure-hosted AI services and models.


What Does “Deploying a Model” Mean?

Deploying a model means making the AI model available for use.

A deployed model can:

  • Receive prompts
  • Process requests
  • Generate responses
  • Be accessed through applications or APIs

Deployment creates an endpoint that applications can use to interact with the model.


What Is a Model Endpoint?

An endpoint is a network-accessible interface that allows applications or users to communicate with a deployed AI model.

Applications send requests to the endpoint and receive AI-generated responses.


Common Deployment Scenarios

Organizations deploy models for many purposes, including:

  • Chatbots
  • AI assistants
  • Document summarization
  • Content generation
  • Customer support systems
  • Code generation
  • Data extraction

Azure AI Foundry Workflow

A simplified workflow in Azure AI Foundry typically includes:

  1. Create or access an Azure AI resource
  2. Open Azure AI Foundry portal
  3. Select a model
  4. Configure deployment settings
  5. Deploy the model
  6. Test prompts
  7. Interact with the model
  8. Integrate the endpoint into applications

Accessing the Foundry Portal

Users access Azure AI Foundry through a web-based portal.

The portal provides graphical tools for:

  • Model selection
  • Prompt testing
  • Deployment management
  • Performance monitoring

Exploring Available Models

Azure AI Foundry allows users to browse available models.

Examples may include:

  • Large Language Models (LLMs)
  • Image-generation models
  • Embedding models
  • Speech models

Models may vary by:

  • Size
  • Performance
  • Cost
  • Supported capabilities

Selecting a Model

Users choose models based on application requirements.

Factors may include:

  • Accuracy
  • Speed
  • Cost
  • Context window size
  • Multimodal support
  • Language support

Example Scenario

A company building a customer support chatbot may choose a conversational large language model.


Deploying a Model in Foundry

The deployment process usually involves:

  • Selecting a model
  • Naming the deployment
  • Choosing deployment settings
  • Allocating resources
  • Creating the endpoint

Deployment Names

Deployments are typically assigned unique names.

Example

support-chat-model

Applications use deployment names when sending requests.


Model Configuration Options

During deployment, users may configure:

  • Model version
  • Scaling options
  • Authentication settings
  • Content filters
  • Region
  • Resource allocation

Content Filtering and Safety

Azure AI Foundry includes Responsible AI safety features.

Content filtering helps reduce:

  • Harmful outputs
  • Offensive content
  • Unsafe responses

This is important for enterprise AI applications.


Interacting with a Deployed Model

After deployment, users can interact with the model directly within the Foundry portal.

This often includes:

  • Entering prompts
  • Viewing responses
  • Adjusting settings
  • Testing behavior

Playground Interfaces

Azure AI Foundry provides playground environments for experimentation.

Playgrounds allow users to:

  • Test prompts
  • Compare outputs
  • Tune settings
  • Evaluate model behavior

Prompt Testing

Users can experiment with:

  • System prompts
  • User prompts
  • Formatting instructions
  • Role prompting

Prompt testing helps improve AI response quality.


Example Prompt Interaction

User Prompt

“Summarize this customer feedback in three bullet points.”

Model Response

The model generates a summarized response.


Model Parameters

Foundry portals may allow adjustment of model parameters such as:

  • Temperature
  • Maximum tokens
  • Top-p sampling

Temperature

Temperature controls response randomness.

Low TemperatureHigh Temperature
More predictableMore creative
More focusedMore varied

Maximum Tokens

Maximum tokens limit response length.

Smaller limits create shorter responses.


System Prompts in Foundry

Users can configure system prompts to guide AI behavior.


Example System Prompt

“You are a professional technical support assistant. Keep responses concise and helpful.”

System prompts influence:

  • Tone
  • Style
  • Safety
  • Formatting

Evaluating Responses

Users should evaluate AI outputs for:

  • Accuracy
  • Relevance
  • Safety
  • Bias
  • Hallucinations

AI-generated content should be reviewed carefully.


Hallucinations

Generative AI models can produce incorrect or fabricated information.

These incorrect outputs are called hallucinations.

Prompt engineering and grounding techniques help reduce hallucinations.


API Access

Once deployed, applications can connect to the model endpoint using APIs.

This allows developers to integrate AI into applications.


Common Integration Scenarios

Applications may use deployed models for:

  • Chat interfaces
  • Search assistants
  • Document analysis
  • AI copilots
  • Workflow automation

Monitoring and Management

Azure AI Foundry supports monitoring deployed models.

Monitoring may include:

  • Usage tracking
  • Performance analysis
  • Error monitoring
  • Cost management

Scaling AI Deployments

Organizations may scale deployments to support:

  • More users
  • Higher request volumes
  • Faster response times

Cloud-based deployments support elastic scaling.


Responsible AI Considerations

When deploying AI models, organizations should consider:

  • Privacy
  • Security
  • Fairness
  • Transparency
  • Safety
  • Compliance

Generative AI applications should include safeguards against misuse.


Authentication and Security

Deployed models typically require secure authentication.

Security features may include:

  • API keys
  • Identity management
  • Access control

Common Challenges

Organizations may encounter challenges such as:

  • High usage costs
  • Latency
  • Hallucinations
  • Unsafe outputs
  • Poor prompt quality

Proper testing and monitoring are important.


Azure OpenAI Service

Azure OpenAI Service provides access to powerful generative AI models that can be deployed and managed through Azure AI Foundry.


Real-World Scenarios


Scenario 1: Customer Support Chatbot

Goal

Deploy a conversational AI assistant.

Activities

  • Deploy language model
  • Configure system prompts
  • Test responses in the playground

Scenario 2: Internal Knowledge Assistant

Goal

Allow employees to ask questions about company documentation.

Activities

  • Deploy AI model
  • Configure prompts
  • Integrate with enterprise systems

Scenario 3: Marketing Content Generator

Goal

Generate product descriptions automatically.

Activities

  • Deploy generative AI model
  • Test prompt variations
  • Evaluate response quality

Important AI-901 Exam Tips

For the exam, remember these key points:

  • Deploying a model makes it available for use.
  • Deployments create accessible endpoints.
  • Azure AI Foundry provides tools for testing and managing models.
  • Playgrounds allow prompt experimentation.
  • System prompts guide model behavior.
  • Temperature controls creativity and randomness.
  • Maximum tokens control response length.
  • AI outputs should be evaluated for accuracy and safety.
  • Content filtering supports Responsible AI practices.
  • APIs allow applications to connect to deployed models.

Quick Knowledge Check

Question 1

What does deploying a model do?

Answer

It makes the AI model available for use through an endpoint.


Question 2

What is the purpose of a playground in Azure AI Foundry?

Answer

To test prompts and interact with deployed models.


Question 3

What does the temperature setting control?

Answer

The randomness and creativity of model responses.


Question 4

Why are content filters important?

Answer

They help reduce harmful or unsafe AI-generated outputs.


Practice Exam Questions

Question 1

What is the PRIMARY purpose of deploying an AI model?

A. To permanently delete the model
B. To make the model available for use through an endpoint
C. To compress training data
D. To convert images into text


Correct Answer

B. To make the model available for use through an endpoint


Explanation

Deploying a model makes it accessible so applications and users can interact with it.


Why the Other Answers Are Incorrect

A. To permanently delete the model

Deployment does not delete models.

C. To compress training data

Deployment is unrelated to data compression.

D. To convert images into text

This describes OCR.


Question 2

What is an endpoint in the context of AI model deployment?

A. A physical server room
B. A network-accessible interface for interacting with a deployed model
C. A type of database backup
D. A computer vision algorithm


Correct Answer

B. A network-accessible interface for interacting with a deployed model


Explanation

Endpoints allow applications and users to send requests to deployed AI models and receive responses.


Why the Other Answers Are Incorrect

A. A physical server room

Endpoints are logical interfaces, not physical locations.

C. A type of database backup

This is unrelated to AI deployment.

D. A computer vision algorithm

Endpoints are not algorithms.


Question 3

Which Azure tool provides playgrounds for testing prompts and interacting with deployed AI models?

A. Azure SQL Database
B. Azure AI Foundry
C. Microsoft Excel
D. Azure Virtual Desktop


Correct Answer

B. Azure AI Foundry


Explanation

Azure AI Foundry provides tools for model deployment, prompt testing, evaluation, and management.


Why the Other Answers Are Incorrect

A. Azure SQL Database

This is a database service.

C. Microsoft Excel

Excel is not an AI deployment platform.

D. Azure Virtual Desktop

This provides desktop virtualization services.


Question 4

What is the PRIMARY purpose of a playground in Azure AI Foundry?

A. Hosting multiplayer games
B. Experimenting with prompts and testing model behavior
C. Managing employee payroll
D. Compressing image files


Correct Answer

B. Experimenting with prompts and testing model behavior


Explanation

Playgrounds allow users to interact with models, test prompts, and evaluate responses.


Why the Other Answers Are Incorrect

A. Hosting multiplayer games

This is unrelated to AI Foundry.

C. Managing employee payroll

This is unrelated to AI development.

D. Compressing image files

Playgrounds are not image utilities.


Question 5

Which configuration setting controls how creative or random AI-generated responses are?

A. OCR level
B. Temperature
C. Resolution scaling
D. Data indexing


Correct Answer

B. Temperature


Explanation

Temperature controls randomness and creativity in generative AI responses.


Why the Other Answers Are Incorrect

A. OCR level

OCR extracts text from images.

C. Resolution scaling

This relates to images, not text generation randomness.

D. Data indexing

Indexing is unrelated to generative response creativity.


Question 6

What is the effect of setting a lower temperature value in a generative AI model?

A. More random responses
B. More predictable and focused responses
C. Faster internet speeds
D. Larger image generation sizes


Correct Answer

B. More predictable and focused responses


Explanation

Lower temperature settings reduce randomness and produce more deterministic outputs.


Why the Other Answers Are Incorrect

A. More random responses

Higher temperatures increase randomness.

C. Faster internet speeds

Temperature does not affect networking.

D. Larger image generation sizes

Temperature is unrelated to image dimensions.


Question 7

Which prompt type defines the AI assistant’s behavior, tone, and rules?

A. User prompt
B. System prompt
C. SQL query
D. OCR prompt


Correct Answer

B. System prompt


Explanation

System prompts provide high-level behavioral instructions to the AI model.


Why the Other Answers Are Incorrect

A. User prompt

User prompts specify tasks or requests.

C. SQL query

SQL queries interact with databases.

D. OCR prompt

OCR is unrelated to conversational AI behavior.


Question 8

Why are content filters important when deploying generative AI models?

A. They improve internet bandwidth
B. They help reduce harmful or unsafe outputs
C. They increase monitor resolution
D. They replace system prompts entirely


Correct Answer

B. They help reduce harmful or unsafe outputs


Explanation

Content filtering supports Responsible AI by helping prevent harmful or inappropriate AI-generated content.


Why the Other Answers Are Incorrect

A. They improve internet bandwidth

Content filters do not affect networking performance.

C. They increase monitor resolution

This is unrelated to AI safety.

D. They replace system prompts entirely

Content filters complement prompts; they do not replace them.


Question 9

What are hallucinations in generative AI?

A. Physical hardware failures
B. Incorrect or fabricated AI-generated information
C. Database replication errors
D. Unauthorized user logins


Correct Answer

B. Incorrect or fabricated AI-generated information


Explanation

Hallucinations occur when AI generates inaccurate or invented information.


Why the Other Answers Are Incorrect

A. Physical hardware failures

This is unrelated to AI hallucinations.

C. Database replication errors

This is a database issue.

D. Unauthorized user logins

This is a security issue.


Question 10

After deploying a model, how do external applications typically interact with it?

A. Through handwritten forms
B. Through APIs connected to the deployment endpoint
C. Through spreadsheet imports only
D. Through local USB connections


Correct Answer

B. Through APIs connected to the deployment endpoint


Explanation

Applications commonly communicate with deployed AI models using APIs and endpoints.


Why the Other Answers Are Incorrect

A. Through handwritten forms

This is unrelated to AI deployment.

C. Through spreadsheet imports only

Spreadsheets are not the primary integration mechanism.

D. Through local USB connections

Cloud AI services typically use network-based APIs, not USB connections.


Final Thoughts

Deploying and interacting with AI models in Azure AI Foundry is an important skill area for the AI-901 certification exam. Microsoft expects candidates to understand the basic deployment workflow, prompt testing process, model configuration options, and Responsible AI considerations involved in building generative AI applications.

Azure AI Foundry simplifies AI development by providing a centralized environment for deploying, testing, and managing AI models and agents.


Go to the AI-901 Exam Prep Hub main page

One thought on “Deploy a model and interact with it in the Foundry Portal (AI-901 Exam Prep)”

Leave a comment