This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Implement AI solutions by using Microsoft Foundry (55–60%)
   --> Implement generative AI apps and agents by using Foundry
      --> Create a lightweight chat client application by using the Foundry SDK

Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Modern generative AI applications often include chat-based interfaces that allow users to interact naturally with AI models. Microsoft Azure AI Foundry provides SDKs (Software Development Kits) that developers can use to build lightweight chat applications that connect to deployed AI models.

For the AI-901 certification exam, candidates should understand the basic concepts behind creating chat client applications using the Foundry SDK and how these applications interact with deployed generative AI models.

This topic falls under the “Implement generative AI apps and agents by using Foundry” section of the AI-901 exam objectives.

What Is a Chat Client Application?

A chat client application is a software application that allows users to communicate with an AI model using conversational prompts and responses.

Users type messages, and the AI model generates replies.

Common Chat Application Examples

Examples include:

AI assistants
Customer support bots
Internal company copilots
Study assistants
Virtual agents
Help desk chatbots

What Is an SDK?

SDK stands for Software Development Kit.

An SDK provides tools and libraries that help developers build applications more easily.

SDKs typically include:

APIs
Authentication tools
Code libraries
Documentation
Example code

What Is the Foundry SDK?

The Foundry SDK allows developers to connect applications to deployed AI models within Azure AI Foundry.

Developers can use SDKs to:

Send prompts
Receive AI-generated responses
Manage conversations
Configure requests
Handle authentication

Why Use an SDK?

Using an SDK simplifies development.

Without an SDK, developers would need to manually handle:

Network requests
Authentication
Error handling
API formatting

SDKs abstract much of this complexity.

Lightweight Chat Applications

A lightweight chat client is a simple application focused on core chat functionality.

It usually includes:

User input field
Conversation display
AI response generation
Basic session management

Basic Chat Workflow

A typical AI chat application workflow includes:

User enters a prompt
Application sends request to deployed model
AI model processes prompt
Model generates response
Application displays response

Connecting to a Deployed Model

Chat applications connect to deployed AI models using:

API endpoints
Authentication credentials
SDK libraries

The deployed model processes incoming prompts.

Authentication

Applications typically authenticate using:

API keys
Azure credentials
Managed identities

Authentication ensures only authorized users and applications can access AI services.

Example Chat Interaction

User

“Explain machine learning in simple terms.”

AI Model

“Machine learning is a type of AI where computers learn patterns from data instead of being explicitly programmed.”

Conversation History

Many chat applications maintain conversation history.

This allows the AI model to remember context during the session.

Example of Context Retention

User

“Who founded Microsoft?”

AI

“Microsoft was founded by Bill Gates and Paul Allen.”

User

“When was it founded?”

Because conversation history is maintained, the AI understands the second question refers to Microsoft.

System Prompts in Chat Applications

Chat applications often include system prompts that guide model behavior.

Example System Prompt

“You are a helpful technical tutor. Explain topics clearly for beginners.”

This influences:

Tone
Style
Behavior
Safety

User Prompts

User prompts represent the questions or requests entered during the conversation.

Example User Prompt

“Explain neural networks.”

Model Responses

The deployed AI model generates responses based on:

System prompt
User prompt
Conversation history
Model parameters

Model Parameters

Chat applications may configure parameters such as:

Temperature
Maximum tokens
Top-p sampling

Temperature

Temperature controls response creativity.

Low Temperature	High Temperature
More focused	More creative
More predictable	More varied

Maximum Tokens

Maximum tokens limit response length.

Smaller values create shorter responses.

Streaming Responses

Some chat applications support streaming responses.

Streaming displays generated text gradually as the model produces it.

This improves user experience by reducing perceived waiting time.

Error Handling

Applications should handle errors gracefully.

Common issues include:

Network failures
Invalid credentials
Rate limits
Timeout errors

Rate Limits

AI services may limit request frequency.

Applications should be designed to handle:

Request throttling
Retry logic
Usage quotas

Responsible AI Considerations

Chat applications should follow Responsible AI principles.

Important considerations include:

Content filtering
Privacy
Safety
Bias reduction
Transparency

Content Filtering

Content filters help reduce:

Harmful responses
Offensive content
Unsafe outputs

Privacy and Security

Applications should protect:

User conversations
Authentication credentials
Sensitive information

Logging and Monitoring

Organizations may monitor chat applications for:

Performance
Usage
Errors
Safety concerns

Azure AI Foundry

Azure AI Foundry provides tools for deploying models and managing generative AI applications.

Developers can:

Deploy models
Test prompts
Monitor applications
Manage AI resources

Azure OpenAI Service

Azure OpenAI Service provides access to generative AI models used in chat applications.

High-Level SDK Workflow

A simplified workflow for a lightweight chat application typically includes:

Install SDK
Configure credentials
Connect to deployed model
Send prompts
Receive responses
Display conversation

Example High-Level Pseudocode

			
connect_to_model()
while True:
    user_prompt = get_user_input()
    response = send_prompt(user_prompt)
    display_response(response)

		

For AI-901, understanding the overall workflow is more important than memorizing syntax.

Common Real-World Scenarios

Scenario 1: Customer Support Chatbot

Goal

Answer customer questions automatically.

Features

Conversational interface
Context retention
Safe responses

Scenario 2: Internal Knowledge Assistant

Goal

Help employees search company information.

Features

Question answering
Document summarization
Secure access

Scenario 3: Educational Tutor

Goal

Provide interactive learning assistance.

Features

Step-by-step explanations
Conversational learning
Prompt customization

Advantages of Chat-Based AI Applications

Benefits include:

Natural user interaction
Faster information access
Automation of repetitive tasks
Improved customer experience
Scalability

Challenges and Limitations

Organizations should consider:

Hallucinations
Incorrect responses
Cost management
Privacy concerns
Latency
Prompt injection risks

Hallucinations

Generative AI models may occasionally generate incorrect or fabricated information.

These incorrect outputs are called hallucinations.

Applications should not assume all AI-generated responses are accurate.

Prompt Injection Risks

Malicious users may attempt to manipulate prompts to bypass safety controls.

Applications should implement safeguards against unsafe behavior.

Important AI-901 Exam Tips

For the exam, remember these key points:

SDKs simplify application development.
Chat clients communicate with deployed AI model endpoints.
System prompts define AI behavior.
User prompts represent user requests.
Conversation history helps maintain context.
Temperature controls response randomness.
Maximum tokens limit response length.
Streaming responses improve user experience.
Responsible AI principles apply to chat applications.
Authentication secures access to AI services.

Quick Knowledge Check

Question 1

What is the purpose of an SDK?

Answer

To simplify application development using tools and libraries.

Question 2

Why is conversation history important in chat applications?

Answer

It helps maintain context across multiple user interactions.

Question 3

What does temperature control in a generative AI model?

Answer

The creativity and randomness of responses.

Question 4

Why are content filters important?

Answer

They help reduce harmful or unsafe AI-generated outputs.

Practice Exam Questions

Question 1

What is the PRIMARY purpose of a chat client application in generative AI?

A. To physically store servers
B. To allow users to interact conversationally with an AI model
C. To compress database files
D. To manage network hardware

Correct Answer

B. To allow users to interact conversationally with an AI model

Explanation

A chat client application enables users to send prompts and receive AI-generated conversational responses.

Why the Other Answers Are Incorrect

A. To physically store servers

Chat clients are software applications, not physical infrastructure.

C. To compress database files

This is unrelated to chat applications.

D. To manage network hardware

This is unrelated to generative AI chat systems.

Question 2

What does SDK stand for?

A. Secure Data Kernel
B. Software Development Kit
C. System Deployment Key
D. Structured Data Kit

Correct Answer

B. Software Development Kit

Explanation

An SDK provides tools, libraries, and documentation that help developers build applications more efficiently.

Why the Other Answers Are Incorrect

A. Secure Data Kernel

This is not the correct definition.

C. System Deployment Key

This is incorrect terminology.

D. Structured Data Kit

This is not the meaning of SDK.

Question 3

Why do developers commonly use SDKs when building AI applications?

A. SDKs eliminate the need for internet access
B. SDKs simplify communication with AI services and APIs
C. SDKs permanently store all prompts automatically
D. SDKs replace AI models entirely

Correct Answer

B. SDKs simplify communication with AI services and APIs

Explanation

SDKs help developers handle authentication, requests, responses, and integration more easily.

Why the Other Answers Are Incorrect

A. SDKs eliminate the need for internet access

Cloud AI services still require connectivity.

C. SDKs permanently store all prompts automatically

SDKs do not inherently provide permanent storage.

D. SDKs replace AI models entirely

SDKs connect applications to models; they do not replace them.

Question 4

What allows a chat application to remember previous user interactions during a conversation?

A. OCR
B. Conversation history
C. Image classification
D. Regression analysis

Correct Answer

B. Conversation history

Explanation

Conversation history preserves context so the AI can respond appropriately across multiple prompts.

Why the Other Answers Are Incorrect

A. OCR

OCR extracts text from images.

C. Image classification

This categorizes images.

D. Regression analysis

Regression predicts numeric values.

Question 5

Which prompt type defines the AI assistant’s behavior and communication style?

A. User prompt
B. System prompt
C. SQL prompt
D. OCR prompt

Correct Answer

B. System prompt

Explanation

System prompts establish behavior rules, tone, style, and safety guidelines.

Why the Other Answers Are Incorrect

A. User prompt

User prompts contain requests or questions.

C. SQL prompt

SQL is related to databases.

D. OCR prompt

OCR is unrelated to conversational behavior.

Question 6

What is the PRIMARY purpose of authentication in a chat client application?

A. To improve image resolution
B. To ensure only authorized users or applications access AI services
C. To increase response creativity
D. To summarize conversations

Correct Answer

B. To ensure only authorized users or applications access AI services

Explanation

Authentication protects AI resources and controls access to deployed services.

Why the Other Answers Are Incorrect

A. To improve image resolution

Authentication does not affect graphics.

C. To increase response creativity

Temperature settings influence creativity.

D. To summarize conversations

Authentication does not summarize data.

Question 7

Which configuration parameter controls how creative or random a generative AI response will be?

A. Temperature
B. OCR threshold
C. Frame rate
D. Compression ratio

Correct Answer

A. Temperature

Explanation

Temperature controls response randomness and creativity.

Why the Other Answers Are Incorrect

B. OCR threshold

This relates to text extraction.

C. Frame rate

This relates to video processing.

D. Compression ratio

This relates to file compression.

Question 8

What is the benefit of streaming AI responses in a chat application?

A. It improves monitor resolution
B. It allows responses to appear gradually as they are generated
C. It permanently stores all conversations
D. It disables content filtering

Correct Answer

B. It allows responses to appear gradually as they are generated

Explanation

Streaming improves user experience by showing generated text incrementally instead of waiting for the entire response.

Why the Other Answers Are Incorrect

A. It improves monitor resolution

Streaming does not affect displays.

C. It permanently stores all conversations

Streaming does not automatically store data.

D. It disables content filtering

Streaming does not remove safety controls.

Question 9

Which Responsible AI feature helps reduce harmful or offensive AI-generated responses?

A. Content filtering
B. Data compression
C. Video rendering
D. File indexing

Correct Answer

A. Content filtering

Explanation

Content filters help prevent unsafe or inappropriate AI outputs.

Why the Other Answers Are Incorrect

B. Data compression

Compression reduces file size.

C. Video rendering

Rendering creates visual output.

D. File indexing

Indexing organizes data for search.

Question 10

What are hallucinations in generative AI systems?

A. Hardware overheating events
B. Incorrect or fabricated AI-generated information
C. Authentication failures
D. Video processing delays

Correct Answer

B. Incorrect or fabricated AI-generated information

Explanation

Hallucinations occur when AI models generate inaccurate or invented information.

Why the Other Answers Are Incorrect

A. Hardware overheating events

This is unrelated to AI hallucinations.

C. Authentication failures

This is a security issue.

D. Video processing delays

This relates to media performance, not AI accuracy.

Final Thoughts

Creating lightweight chat applications with the Foundry SDK is an important concept for the AI-901 certification exam. Microsoft expects candidates to understand the basic architecture and workflow of AI-powered chat applications, including prompts, endpoints, authentication, conversation management, and Responsible AI considerations.

Azure AI Foundry and Azure OpenAI Service provide powerful tools that allow developers to build conversational AI experiences quickly and efficiently.

Go to the AI-901 Exam Prep Hub main page