This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub.
This topic falls under these sections:
Implement AI solutions by using Microsoft Foundry (55–60%)
--> Implement generative AI apps and agents by using Foundry
--> Create a lightweight chat client application by using the Foundry SDK
Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.
Modern generative AI applications often include chat-based interfaces that allow users to interact naturally with AI models. Microsoft Azure AI Foundry provides SDKs (Software Development Kits) that developers can use to build lightweight chat applications that connect to deployed AI models.
For the AI-901 certification exam, candidates should understand the basic concepts behind creating chat client applications using the Foundry SDK and how these applications interact with deployed generative AI models.
This topic falls under the “Implement generative AI apps and agents by using Foundry” section of the AI-901 exam objectives.
What Is a Chat Client Application?
A chat client application is a software application that allows users to communicate with an AI model using conversational prompts and responses.
Users type messages, and the AI model generates replies.
Common Chat Application Examples
Examples include:
- AI assistants
- Customer support bots
- Internal company copilots
- Study assistants
- Virtual agents
- Help desk chatbots
What Is an SDK?
SDK stands for Software Development Kit.
An SDK provides tools and libraries that help developers build applications more easily.
SDKs typically include:
- APIs
- Authentication tools
- Code libraries
- Documentation
- Example code
What Is the Foundry SDK?
The Foundry SDK allows developers to connect applications to deployed AI models within Azure AI Foundry.
Developers can use SDKs to:
- Send prompts
- Receive AI-generated responses
- Manage conversations
- Configure requests
- Handle authentication
Why Use an SDK?
Using an SDK simplifies development.
Without an SDK, developers would need to manually handle:
- Network requests
- Authentication
- Error handling
- API formatting
SDKs abstract much of this complexity.
Lightweight Chat Applications
A lightweight chat client is a simple application focused on core chat functionality.
It usually includes:
- User input field
- Conversation display
- AI response generation
- Basic session management
Basic Chat Workflow
A typical AI chat application workflow includes:
- User enters a prompt
- Application sends request to deployed model
- AI model processes prompt
- Model generates response
- Application displays response
Connecting to a Deployed Model
Chat applications connect to deployed AI models using:
- API endpoints
- Authentication credentials
- SDK libraries
The deployed model processes incoming prompts.
Authentication
Applications typically authenticate using:
- API keys
- Azure credentials
- Managed identities
Authentication ensures only authorized users and applications can access AI services.
Example Chat Interaction
User
“Explain machine learning in simple terms.”
AI Model
“Machine learning is a type of AI where computers learn patterns from data instead of being explicitly programmed.”
Conversation History
Many chat applications maintain conversation history.
This allows the AI model to remember context during the session.
Example of Context Retention
User
“Who founded Microsoft?”
AI
“Microsoft was founded by Bill Gates and Paul Allen.”
User
“When was it founded?”
Because conversation history is maintained, the AI understands the second question refers to Microsoft.
System Prompts in Chat Applications
Chat applications often include system prompts that guide model behavior.
Example System Prompt
“You are a helpful technical tutor. Explain topics clearly for beginners.”
This influences:
- Tone
- Style
- Behavior
- Safety
User Prompts
User prompts represent the questions or requests entered during the conversation.
Example User Prompt
“Explain neural networks.”
Model Responses
The deployed AI model generates responses based on:
- System prompt
- User prompt
- Conversation history
- Model parameters
Model Parameters
Chat applications may configure parameters such as:
- Temperature
- Maximum tokens
- Top-p sampling
Temperature
Temperature controls response creativity.
| Low Temperature | High Temperature |
|---|---|
| More focused | More creative |
| More predictable | More varied |
Maximum Tokens
Maximum tokens limit response length.
Smaller values create shorter responses.
Streaming Responses
Some chat applications support streaming responses.
Streaming displays generated text gradually as the model produces it.
This improves user experience by reducing perceived waiting time.
Error Handling
Applications should handle errors gracefully.
Common issues include:
- Network failures
- Invalid credentials
- Rate limits
- Timeout errors
Rate Limits
AI services may limit request frequency.
Applications should be designed to handle:
- Request throttling
- Retry logic
- Usage quotas
Responsible AI Considerations
Chat applications should follow Responsible AI principles.
Important considerations include:
- Content filtering
- Privacy
- Safety
- Bias reduction
- Transparency
Content Filtering
Content filters help reduce:
- Harmful responses
- Offensive content
- Unsafe outputs
Privacy and Security
Applications should protect:
- User conversations
- Authentication credentials
- Sensitive information
Logging and Monitoring
Organizations may monitor chat applications for:
- Performance
- Usage
- Errors
- Safety concerns
Azure AI Foundry
Azure AI Foundry provides tools for deploying models and managing generative AI applications.
Developers can:
- Deploy models
- Test prompts
- Monitor applications
- Manage AI resources
Azure OpenAI Service
Azure OpenAI Service provides access to generative AI models used in chat applications.
High-Level SDK Workflow
A simplified workflow for a lightweight chat application typically includes:
- Install SDK
- Configure credentials
- Connect to deployed model
- Send prompts
- Receive responses
- Display conversation
Example High-Level Pseudocode
connect_to_model()while True: user_prompt = get_user_input() response = send_prompt(user_prompt) display_response(response)
For AI-901, understanding the overall workflow is more important than memorizing syntax.
Common Real-World Scenarios
Scenario 1: Customer Support Chatbot
Goal
Answer customer questions automatically.
Features
- Conversational interface
- Context retention
- Safe responses
Scenario 2: Internal Knowledge Assistant
Goal
Help employees search company information.
Features
- Question answering
- Document summarization
- Secure access
Scenario 3: Educational Tutor
Goal
Provide interactive learning assistance.
Features
- Step-by-step explanations
- Conversational learning
- Prompt customization
Advantages of Chat-Based AI Applications
Benefits include:
- Natural user interaction
- Faster information access
- Automation of repetitive tasks
- Improved customer experience
- Scalability
Challenges and Limitations
Organizations should consider:
- Hallucinations
- Incorrect responses
- Cost management
- Privacy concerns
- Latency
- Prompt injection risks
Hallucinations
Generative AI models may occasionally generate incorrect or fabricated information.
These incorrect outputs are called hallucinations.
Applications should not assume all AI-generated responses are accurate.
Prompt Injection Risks
Malicious users may attempt to manipulate prompts to bypass safety controls.
Applications should implement safeguards against unsafe behavior.
Important AI-901 Exam Tips
For the exam, remember these key points:
- SDKs simplify application development.
- Chat clients communicate with deployed AI model endpoints.
- System prompts define AI behavior.
- User prompts represent user requests.
- Conversation history helps maintain context.
- Temperature controls response randomness.
- Maximum tokens limit response length.
- Streaming responses improve user experience.
- Responsible AI principles apply to chat applications.
- Authentication secures access to AI services.
Quick Knowledge Check
Question 1
What is the purpose of an SDK?
Answer
To simplify application development using tools and libraries.
Question 2
Why is conversation history important in chat applications?
Answer
It helps maintain context across multiple user interactions.
Question 3
What does temperature control in a generative AI model?
Answer
The creativity and randomness of responses.
Question 4
Why are content filters important?
Answer
They help reduce harmful or unsafe AI-generated outputs.
Practice Exam Questions
Question 1
What is the PRIMARY purpose of a chat client application in generative AI?
A. To physically store servers
B. To allow users to interact conversationally with an AI model
C. To compress database files
D. To manage network hardware
Correct Answer
B. To allow users to interact conversationally with an AI model
Explanation
A chat client application enables users to send prompts and receive AI-generated conversational responses.
Why the Other Answers Are Incorrect
A. To physically store servers
Chat clients are software applications, not physical infrastructure.
C. To compress database files
This is unrelated to chat applications.
D. To manage network hardware
This is unrelated to generative AI chat systems.
Question 2
What does SDK stand for?
A. Secure Data Kernel
B. Software Development Kit
C. System Deployment Key
D. Structured Data Kit
Correct Answer
B. Software Development Kit
Explanation
An SDK provides tools, libraries, and documentation that help developers build applications more efficiently.
Why the Other Answers Are Incorrect
A. Secure Data Kernel
This is not the correct definition.
C. System Deployment Key
This is incorrect terminology.
D. Structured Data Kit
This is not the meaning of SDK.
Question 3
Why do developers commonly use SDKs when building AI applications?
A. SDKs eliminate the need for internet access
B. SDKs simplify communication with AI services and APIs
C. SDKs permanently store all prompts automatically
D. SDKs replace AI models entirely
Correct Answer
B. SDKs simplify communication with AI services and APIs
Explanation
SDKs help developers handle authentication, requests, responses, and integration more easily.
Why the Other Answers Are Incorrect
A. SDKs eliminate the need for internet access
Cloud AI services still require connectivity.
C. SDKs permanently store all prompts automatically
SDKs do not inherently provide permanent storage.
D. SDKs replace AI models entirely
SDKs connect applications to models; they do not replace them.
Question 4
What allows a chat application to remember previous user interactions during a conversation?
A. OCR
B. Conversation history
C. Image classification
D. Regression analysis
Correct Answer
B. Conversation history
Explanation
Conversation history preserves context so the AI can respond appropriately across multiple prompts.
Why the Other Answers Are Incorrect
A. OCR
OCR extracts text from images.
C. Image classification
This categorizes images.
D. Regression analysis
Regression predicts numeric values.
Question 5
Which prompt type defines the AI assistant’s behavior and communication style?
A. User prompt
B. System prompt
C. SQL prompt
D. OCR prompt
Correct Answer
B. System prompt
Explanation
System prompts establish behavior rules, tone, style, and safety guidelines.
Why the Other Answers Are Incorrect
A. User prompt
User prompts contain requests or questions.
C. SQL prompt
SQL is related to databases.
D. OCR prompt
OCR is unrelated to conversational behavior.
Question 6
What is the PRIMARY purpose of authentication in a chat client application?
A. To improve image resolution
B. To ensure only authorized users or applications access AI services
C. To increase response creativity
D. To summarize conversations
Correct Answer
B. To ensure only authorized users or applications access AI services
Explanation
Authentication protects AI resources and controls access to deployed services.
Why the Other Answers Are Incorrect
A. To improve image resolution
Authentication does not affect graphics.
C. To increase response creativity
Temperature settings influence creativity.
D. To summarize conversations
Authentication does not summarize data.
Question 7
Which configuration parameter controls how creative or random a generative AI response will be?
A. Temperature
B. OCR threshold
C. Frame rate
D. Compression ratio
Correct Answer
A. Temperature
Explanation
Temperature controls response randomness and creativity.
Why the Other Answers Are Incorrect
B. OCR threshold
This relates to text extraction.
C. Frame rate
This relates to video processing.
D. Compression ratio
This relates to file compression.
Question 8
What is the benefit of streaming AI responses in a chat application?
A. It improves monitor resolution
B. It allows responses to appear gradually as they are generated
C. It permanently stores all conversations
D. It disables content filtering
Correct Answer
B. It allows responses to appear gradually as they are generated
Explanation
Streaming improves user experience by showing generated text incrementally instead of waiting for the entire response.
Why the Other Answers Are Incorrect
A. It improves monitor resolution
Streaming does not affect displays.
C. It permanently stores all conversations
Streaming does not automatically store data.
D. It disables content filtering
Streaming does not remove safety controls.
Question 9
Which Responsible AI feature helps reduce harmful or offensive AI-generated responses?
A. Content filtering
B. Data compression
C. Video rendering
D. File indexing
Correct Answer
A. Content filtering
Explanation
Content filters help prevent unsafe or inappropriate AI outputs.
Why the Other Answers Are Incorrect
B. Data compression
Compression reduces file size.
C. Video rendering
Rendering creates visual output.
D. File indexing
Indexing organizes data for search.
Question 10
What are hallucinations in generative AI systems?
A. Hardware overheating events
B. Incorrect or fabricated AI-generated information
C. Authentication failures
D. Video processing delays
Correct Answer
B. Incorrect or fabricated AI-generated information
Explanation
Hallucinations occur when AI models generate inaccurate or invented information.
Why the Other Answers Are Incorrect
A. Hardware overheating events
This is unrelated to AI hallucinations.
C. Authentication failures
This is a security issue.
D. Video processing delays
This relates to media performance, not AI accuracy.
Final Thoughts
Creating lightweight chat applications with the Foundry SDK is an important concept for the AI-901 certification exam. Microsoft expects candidates to understand the basic architecture and workflow of AI-powered chat applications, including prompts, endpoints, authentication, conversation management, and Responsible AI considerations.
Azure AI Foundry and Azure OpenAI Service provide powerful tools that allow developers to build conversational AI experiences quickly and efficiently.
Go to the AI-901 Exam Prep Hub main page

One thought on “Create a lightweight chat client application by using the Foundry SDK (AI-901 Exam Prep)”