Category: AI-901

Identify an appropriate AI model, based on capabilities (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Identify AI concepts and capabilities (40–45%)
--> Identify AI model components and configurations
--> Identify an appropriate AI model, based on capabilities


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Selecting the correct AI model for a specific business problem is an important skill and a key topic for the AI-901 certification exam. Microsoft expects candidates to understand the capabilities of common AI model types and recognize which model is appropriate for different scenarios.

This topic falls under the “Identify AI model components and configurations” section of the exam objectives.


Why Choosing the Right AI Model Matters

Different AI models are designed for different types of tasks.

Choosing the wrong model may lead to:

  • Poor accuracy
  • Inefficient processing
  • Increased costs
  • Unusable results
  • Poor user experiences

Understanding model capabilities helps organizations build effective AI solutions.


Major Categories of AI Models

For AI-901, you should understand the capabilities of several major AI model categories:

  • Classification models
  • Regression models
  • Clustering models
  • Computer vision models
  • Natural language processing (NLP) models
  • Generative AI models
  • Recommendation systems
  • Anomaly detection models

Classification Models

Classification models predict categories or labels.

They answer questions such as:

  • “What type is this?”
  • “Which category does this belong to?”

Common Use Cases

  • Spam email detection
  • Fraud detection
  • Sentiment analysis
  • Medical diagnosis classification
  • Image categorization

Example

A model predicts whether an email is:

  • Spam
  • Not spam

This is a classification problem.


Binary Classification

Binary classification predicts one of two possible outcomes.

Examples

  • Fraud or not fraud
  • Approved or denied
  • Positive or negative sentiment

Multiclass Classification

Multiclass classification predicts one of several categories.

Example

An AI model identifies whether an image contains:

  • A dog
  • A cat
  • A bird
  • A horse

Regression Models

Regression models predict numeric values.

They answer questions such as:

  • “How much?”
  • “How many?”
  • “What value?”

Common Use Cases

  • House price prediction
  • Sales forecasting
  • Temperature prediction
  • Demand estimation

Example

Predicting the selling price of a house based on:

  • Size
  • Location
  • Number of bedrooms

This is a regression problem.


Clustering Models

Clustering models group similar items together without predefined labels.

Clustering is a type of unsupervised learning.

Common Use Cases

  • Customer segmentation
  • Market analysis
  • Pattern discovery
  • Grouping similar documents

Example

A retailer groups customers based on purchasing behavior.

The model discovers patterns automatically.


Computer Vision Models

Computer vision models analyze images and video.

Common Capabilities

  • Object detection
  • Facial recognition
  • Image classification
  • Optical Character Recognition (OCR)
  • Image tagging

Example Use Cases

  • Self-driving cars
  • Security systems
  • Medical imaging
  • Product identification

Image Classification

Image classification identifies what appears in an image.

Example

Determining whether an image contains:

  • A cat
  • A dog
  • A car

Object Detection

Object detection identifies and locates objects within an image.

Example

A traffic monitoring system detects:

  • Cars
  • Pedestrians
  • Traffic lights

and determines their positions.


Optical Character Recognition (OCR)

OCR extracts text from images or scanned documents.

Example

Reading text from:

  • Receipts
  • Invoices
  • Forms
  • License plates

Natural Language Processing (NLP) Models

NLP models work with human language.

Common Capabilities

  • Sentiment analysis
  • Translation
  • Text summarization
  • Chatbots
  • Speech recognition
  • Named entity recognition

Example Use Cases

  • Customer support chatbots
  • Language translation apps
  • Voice assistants

Sentiment Analysis

Sentiment analysis identifies emotional tone in text.

Example

Determining whether a product review is:

  • Positive
  • Negative
  • Neutral

Translation Models

Translation models convert text between languages.

Example

Converting English text into Spanish.


Speech Recognition

Speech recognition converts spoken language into text.

Example

Voice assistants converting speech commands into written text.


Generative AI Models

Generative AI models create new content.

Common Outputs

  • Text
  • Images
  • Audio
  • Video
  • Code

Example Use Cases

  • AI chatbots
  • Content generation
  • Image creation
  • Coding assistants

Large Language Models (LLMs)

LLMs are generative AI models focused on language tasks.

Capabilities

  • Conversations
  • Summarization
  • Question answering
  • Content generation
  • Code generation

Example

An AI assistant answering user questions in natural language.


Recommendation Systems

Recommendation systems suggest items users may prefer.

Common Use Cases

  • Product recommendations
  • Movie recommendations
  • Music recommendations
  • Online advertising

Example

An online retailer recommends products based on browsing history.


Anomaly Detection Models

Anomaly detection models identify unusual patterns or behaviors.

Common Use Cases

  • Fraud detection
  • Cybersecurity monitoring
  • Equipment failure prediction
  • Network intrusion detection

Example

A bank identifies suspicious credit card transactions.


Supervised vs. Unsupervised Learning

Understanding learning types helps identify appropriate models.

Learning TypeDescription
Supervised LearningUses labeled data
Unsupervised LearningFinds patterns without labels

Supervised Examples

  • Classification
  • Regression

Unsupervised Examples

  • Clustering
  • Some anomaly detection systems

Choosing the Right AI Model

To select an appropriate AI model, ask:


What Type of Output Is Needed?

GoalModel Type
Predict categoriesClassification
Predict numbersRegression
Group similar itemsClustering
Generate contentGenerative AI
Analyze imagesComputer Vision
Process languageNLP

Is the Data Labeled?

Data TypeAppropriate Learning Type
Labeled dataSupervised learning
Unlabeled dataUnsupervised learning

What Content Is Being Processed?

Content TypeAppropriate Model
TextNLP or LLM
ImagesComputer Vision
AudioSpeech models
Numerical dataRegression or classification

Real-World Examples


Scenario 1: Email Spam Detection

Goal

Identify whether emails are spam.

Best Model

Classification model


Scenario 2: Predicting House Prices

Goal

Estimate home values.

Best Model

Regression model


Scenario 3: Grouping Customers by Buying Behavior

Goal

Identify customer segments.

Best Model

Clustering model


Scenario 4: AI Chatbot

Goal

Generate conversational responses.

Best Model

Large Language Model (LLM)


Scenario 5: Reading Text from Scanned Documents

Goal

Extract printed text.

Best Model

OCR computer vision model


Scenario 6: Detecting Fraudulent Transactions

Goal

Identify suspicious activity.

Best Model

Anomaly detection model


Azure AI Services and Model Types

Microsoft Azure AI Services provide many prebuilt AI capabilities, including:

  • Vision services
  • Speech services
  • Language services
  • Generative AI tools
  • Document intelligence
  • Recommendation capabilities

Microsoft Azure helps organizations apply the correct AI models to different business scenarios.


Responsible AI Considerations

When selecting AI models, organizations should also consider:

  • Fairness
  • Transparency
  • Privacy
  • Reliability
  • Inclusiveness
  • Accountability

A technically accurate model may still create ethical or operational concerns if deployed improperly.


Important AI-901 Exam Tips

For the exam, remember these key points:

  • Classification predicts categories.
  • Regression predicts numeric values.
  • Clustering groups similar items.
  • NLP models process language.
  • Computer vision models process images and video.
  • Generative AI creates new content.
  • Recommendation systems suggest relevant items.
  • Anomaly detection identifies unusual behavior.
  • LLMs are generative AI models for language tasks.
  • OCR extracts text from images or documents.

Quick Knowledge Check

Question 1

Which model type is best for predicting numeric values?

Answer

Regression models.


Question 2

Which AI capability is used to extract text from scanned documents?

Answer

Optical Character Recognition (OCR).


Question 3

What type of model is typically used for chatbots that generate responses?

Answer

Large Language Models (LLMs).


Question 4

Which learning type uses unlabeled data?

Answer

Unsupervised learning.


Practice Exam Questions

Question 1

A company wants to predict future monthly sales revenue based on historical sales data.

Which type of AI model is MOST appropriate?

A. Classification
B. Regression
C. Clustering
D. Computer vision


Correct Answer

B. Regression


Explanation

Regression models are used to predict numeric values such as revenue, prices, or temperatures.


Why the Other Answers Are Incorrect

A. Classification

Classification predicts categories, not numeric values.

C. Clustering

Clustering groups similar items.

D. Computer vision

Computer vision processes images and video.


Question 2

An organization wants to identify whether emails are spam or not spam.

Which type of AI model should be used?

A. Regression
B. Clustering
C. Classification
D. OCR


Correct Answer

C. Classification


Explanation

Spam detection is a classification problem because the output belongs to predefined categories: spam or not spam.


Why the Other Answers Are Incorrect

A. Regression

Regression predicts numeric values.

B. Clustering

Clustering groups unlabeled data.

D. OCR

OCR extracts text from images.


Question 3

Which AI capability is MOST appropriate for extracting text from scanned documents?

A. Object detection
B. OCR
C. Regression
D. Recommendation system


Correct Answer

B. OCR


Explanation

Optical Character Recognition (OCR) extracts printed or handwritten text from images or scanned documents.


Why the Other Answers Are Incorrect

A. Object detection

Object detection identifies objects within images.

C. Regression

Regression predicts numeric values.

D. Recommendation system

Recommendation systems suggest items to users.


Question 4

A retailer wants to group customers based on purchasing behavior without predefined labels.

Which type of AI model is MOST appropriate?

A. Classification
B. Regression
C. Clustering
D. Translation


Correct Answer

C. Clustering


Explanation

Clustering models group similar data points together without labeled categories.


Why the Other Answers Are Incorrect

A. Classification

Classification requires labeled categories.

B. Regression

Regression predicts numbers.

D. Translation

Translation converts text between languages.


Question 5

Which type of AI model is BEST suited for generating natural language responses in a chatbot?

A. Large Language Model (LLM)
B. Regression model
C. Clustering model
D. Decision tree only


Correct Answer

A. Large Language Model (LLM)


Explanation

LLMs are generative AI models designed for language tasks such as conversation, summarization, and question answering.


Why the Other Answers Are Incorrect

B. Regression model

Regression predicts numeric values.

C. Clustering model

Clustering groups similar data.

D. Decision tree only

Decision trees are not specialized for conversational text generation.


Question 6

A bank wants to identify suspicious credit card transactions that differ from normal spending patterns.

Which AI capability is MOST appropriate?

A. Sentiment analysis
B. Anomaly detection
C. OCR
D. Image classification


Correct Answer

B. Anomaly detection


Explanation

Anomaly detection models identify unusual or abnormal behavior that may indicate fraud or security issues.


Why the Other Answers Are Incorrect

A. Sentiment analysis

Sentiment analysis evaluates emotional tone in text.

C. OCR

OCR extracts text from images.

D. Image classification

Image classification categorizes images.


Question 7

What is the PRIMARY capability of a computer vision model?

A. Predicting stock prices
B. Processing and analyzing visual content such as images and video
C. Translating text between languages
D. Generating database queries


Correct Answer

B. Processing and analyzing visual content such as images and video


Explanation

Computer vision models work with images and video to identify objects, text, faces, and other visual information.


Why the Other Answers Are Incorrect

A. Predicting stock prices

This is typically a regression problem.

C. Translating text between languages

Translation is an NLP task.

D. Generating database queries

This is not the primary role of computer vision.


Question 8

A streaming service suggests movies based on a user’s viewing history.

Which AI capability is being used?

A. Recommendation system
B. OCR
C. Regression
D. Object detection


Correct Answer

A. Recommendation system


Explanation

Recommendation systems suggest products, movies, music, or other items based on user behavior and preferences.


Why the Other Answers Are Incorrect

B. OCR

OCR extracts text from images.

C. Regression

Regression predicts numeric values.

D. Object detection

Object detection identifies objects in images.


Question 9

Which type of AI model would MOST likely be used for language translation?

A. NLP model
B. Clustering model
C. Regression model
D. Computer vision model


Correct Answer

A. NLP model


Explanation

Natural Language Processing (NLP) models are designed to process and understand human language, including translation tasks.


Why the Other Answers Are Incorrect

B. Clustering model

Clustering groups similar items.

C. Regression model

Regression predicts numeric outputs.

D. Computer vision model

Computer vision analyzes images and video.


Question 10

Which statement BEST describes the difference between classification and regression models?

A. Classification predicts categories, while regression predicts numeric values
B. Classification uses images, while regression uses text only
C. Regression groups data, while classification predicts prices
D. Regression and classification are identical


Correct Answer

A. Classification predicts categories, while regression predicts numeric values


Explanation

Classification models predict labels or categories, while regression models predict continuous numeric values.


Why the Other Answers Are Incorrect

B. Classification uses images, while regression uses text only

Both models can work with many data types.

C. Regression groups data, while classification predicts prices

Grouping data is clustering, not regression.

D. Regression and classification are identical

They solve different types of problems.


Final Thoughts

Understanding AI model capabilities is a critical foundational skill for the AI-901 certification exam. Microsoft expects candidates to recognize which AI model types are appropriate for different business scenarios and understand the strengths of common AI approaches.

Knowing how to match business problems to the correct AI capabilities is essential for designing effective AI solutions on Azure and beyond.


Go to the AI-901 Exam Prep Hub main page

Identify scenarios for common AI workloads, Including Generative and Agentic AI, Text Analysis, Speech, Computer Vision, and Information Extraction (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Identify AI concepts and capabilities (40–45%)
--> Identify AI workloads
--> Identify scenarios for common AI workloads, Including Generative and Agentic AI, Text Analysis, Speech, Computer Vision, and Information Extraction


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Understanding common AI workloads is one of the foundational concepts in artificial intelligence and a major focus area of the AI-901 certification exam. Microsoft expects candidates to recognize different types of AI workloads and identify appropriate real-world scenarios for each.

This topic falls under the “Identify AI workloads” section of the exam objectives.


What Is an AI Workload?

An AI workload is a category of AI tasks designed to solve a particular type of problem.

Different workloads specialize in processing different types of data such as:

  • Text
  • Speech
  • Images
  • Documents
  • Audio
  • Video

Understanding AI workloads helps organizations choose the correct AI technologies for business solutions.


Major AI Workloads for AI-901

For the AI-901 exam, you should understand these common AI workloads:

  • Generative AI
  • Agentic AI
  • Text analysis
  • Speech AI
  • Computer vision
  • Information extraction

Generative AI

Generative AI creates new content based on patterns learned from training data.

Common Outputs

  • Text
  • Images
  • Audio
  • Video
  • Code

Common Scenarios

  • AI chatbots
  • Content creation
  • Email drafting
  • Code generation
  • Image generation
  • Text summarization

Example

A marketing team uses AI to generate product descriptions automatically.


Large Language Models (LLMs)

Many generative AI systems use Large Language Models (LLMs).

LLMs are trained on massive text datasets and can:

  • Answer questions
  • Summarize content
  • Generate text
  • Translate languages
  • Assist with coding

Example

An AI assistant generates meeting summaries from conversation transcripts.


Agentic AI

Agentic AI refers to AI systems that can autonomously plan, reason, and take actions to accomplish goals.

Agentic AI systems may:

  • Make decisions
  • Perform multi-step tasks
  • Use tools
  • Interact with applications
  • Adapt based on feedback

Unlike simple chatbots, agentic AI systems can perform actions and workflows.


Agentic AI Scenarios

Examples

  • AI travel planning assistants
  • Autonomous customer support agents
  • AI workflow automation systems
  • AI research assistants
  • Scheduling assistants

Example

An AI assistant receives a request to schedule a meeting, checks calendars, sends invitations, and updates schedules automatically.


Text Analysis

Text analysis is an AI workload focused on understanding and processing written language.

Text analysis is part of Natural Language Processing (NLP).

Common Capabilities

  • Sentiment analysis
  • Key phrase extraction
  • Language detection
  • Named entity recognition
  • Text classification

Sentiment Analysis

Sentiment analysis identifies emotional tone in text.

Example Scenarios

  • Product review analysis
  • Social media monitoring
  • Customer feedback analysis

Example

An organization analyzes customer reviews to determine whether feedback is positive or negative.


Key Phrase Extraction

Key phrase extraction identifies important terms or phrases in text.

Example Scenarios

  • Document summarization
  • Search indexing
  • Topic identification

Example

An AI system extracts important keywords from support tickets.


Language Detection

Language detection identifies the language used in text.

Example Scenarios

  • Multilingual applications
  • Translation routing
  • Global customer support

Example

A website detects whether incoming text is English, Spanish, or French.


Named Entity Recognition (NER)

NER identifies important entities in text such as:

  • People
  • Organizations
  • Locations
  • Dates

Example

An AI system extracts company names and locations from contracts.


Speech AI

Speech AI works with spoken language and audio.

Common Capabilities

  • Speech-to-text
  • Text-to-speech
  • Speech translation
  • Speaker recognition

Speech-to-Text

Speech-to-text converts spoken audio into written text.

Example Scenarios

  • Voice transcription
  • Meeting captions
  • Voice assistants

Example

A meeting platform generates live captions during conferences.


Text-to-Speech

Text-to-speech converts written text into spoken audio.

Example Scenarios

  • Accessibility tools
  • Virtual assistants
  • Audiobooks
  • Navigation systems

Example

A navigation app reads driving directions aloud.


Speech Translation

Speech translation converts spoken language into another language.

Example Scenarios

  • International meetings
  • Travel applications
  • Multilingual support systems

Example

A conference tool translates spoken English into Spanish in real time.


Computer Vision

Computer vision enables AI systems to analyze images and video.

Common Capabilities

  • Image classification
  • Object detection
  • Facial recognition
  • OCR
  • Image tagging

Image Classification

Image classification identifies the contents of an image.

Example Scenarios

  • Medical image analysis
  • Product categorization
  • Wildlife monitoring

Example

An AI system identifies whether an image contains a cat or a dog.


Object Detection

Object detection identifies and locates objects within an image.

Example Scenarios

  • Traffic monitoring
  • Security surveillance
  • Manufacturing inspection

Example

A self-driving car detects pedestrians and vehicles.


Optical Character Recognition (OCR)

OCR extracts text from images or scanned documents.

Example Scenarios

  • Invoice processing
  • Form digitization
  • Receipt scanning

Example

An AI system extracts totals and dates from receipts.


Facial Recognition

Facial recognition identifies or verifies people using facial features.

Example Scenarios

  • Building access systems
  • Smartphone authentication
  • Security systems

Example

A mobile phone unlocks using facial recognition.


Information Extraction

Information extraction identifies and retrieves structured information from unstructured content.

This workload often combines:

  • OCR
  • NLP
  • Document analysis

Information Extraction Scenarios

Examples

  • Invoice processing
  • Contract analysis
  • Insurance claims processing
  • Healthcare form processing

Example

An AI system extracts invoice numbers, dates, and totals from scanned invoices automatically.


Structured vs. Unstructured Data

AI workloads often process unstructured data.

Structured DataUnstructured Data
TablesDocuments
DatabasesImages
SpreadsheetsAudio
Defined formatsVideos

Many AI workloads specialize in converting unstructured data into structured information.


Choosing the Correct AI Workload

Understanding the business problem helps determine the correct AI workload.

ScenarioAppropriate Workload
Generate contentGenerative AI
Perform autonomous tasksAgentic AI
Analyze written reviewsText analysis
Convert speech to textSpeech AI
Analyze imagesComputer vision
Extract data from formsInformation extraction

Real-World Examples


Scenario 1: Customer Support Chatbot

Goal

Answer customer questions naturally.

Appropriate Workload

Generative AI


Scenario 2: AI Scheduling Assistant

Goal

Manage appointments automatically.

Appropriate Workload

Agentic AI


Scenario 3: Review Analysis System

Goal

Determine customer sentiment.

Appropriate Workload

Text analysis


Scenario 4: Live Meeting Captions

Goal

Convert speech into text in real time.

Appropriate Workload

Speech AI


Scenario 5: Self-Driving Vehicle

Goal

Detect objects and surroundings.

Appropriate Workload

Computer vision


Scenario 6: Invoice Data Extraction

Goal

Extract invoice information automatically.

Appropriate Workload

Information extraction


Azure AI Services for Common Workloads

Microsoft Azure AI Services provide prebuilt tools for many AI workloads, including:

  • Azure AI Language
  • Azure AI Speech
  • Azure AI Vision
  • Azure AI Document Intelligence
  • Azure OpenAI Service

These services help organizations build AI solutions without creating models from scratch.


Responsible AI Considerations

All AI workloads should follow Responsible AI principles, including:

  • Fairness
  • Privacy
  • Transparency
  • Reliability
  • Inclusiveness
  • Accountability

Organizations should ensure AI systems are used ethically and safely.


Important AI-901 Exam Tips

For the exam, remember these key points:

  • Generative AI creates new content.
  • Agentic AI can autonomously perform tasks and workflows.
  • Text analysis processes written language.
  • Speech AI works with spoken language and audio.
  • Computer vision processes images and video.
  • OCR extracts text from images.
  • Information extraction converts unstructured data into structured information.
  • Sentiment analysis determines emotional tone in text.
  • Named Entity Recognition identifies important entities in text.

Quick Knowledge Check

Question 1

Which AI workload is best for generating marketing content?

Answer

Generative AI.


Question 2

Which AI workload converts spoken language into written text?

Answer

Speech AI.


Question 3

What does OCR do?

Answer

Extracts text from images or scanned documents.


Question 4

Which workload is designed to autonomously complete tasks and workflows?

Answer

Agentic AI.


Practice Exam Questions

Question 1

A company wants an AI system that can automatically generate marketing emails and product descriptions.

Which AI workload is MOST appropriate?

A. Computer vision
B. Generative AI
C. OCR
D. Regression analysis


Correct Answer

B. Generative AI


Explanation

Generative AI creates new content such as text, images, audio, and code based on learned patterns.


Why the Other Answers Are Incorrect

A. Computer vision

Computer vision analyzes images and video.

C. OCR

OCR extracts text from images.

D. Regression analysis

Regression predicts numeric values.


Question 2

An organization wants an AI assistant that can schedule meetings, send invitations, and update calendars automatically.

Which AI workload BEST fits this scenario?

A. Speech AI
B. Agentic AI
C. Clustering
D. OCR


Correct Answer

B. Agentic AI


Explanation

Agentic AI systems can autonomously perform multi-step tasks, make decisions, and interact with tools or applications.


Why the Other Answers Are Incorrect

A. Speech AI

Speech AI processes spoken language.

C. Clustering

Clustering groups similar data.

D. OCR

OCR extracts text from images.


Question 3

Which AI workload is MOST appropriate for determining whether customer reviews are positive or negative?

A. Sentiment analysis
B. Object detection
C. Regression
D. Facial recognition


Correct Answer

A. Sentiment analysis


Explanation

Sentiment analysis is a text analysis capability that identifies emotional tone in written text.


Why the Other Answers Are Incorrect

B. Object detection

Object detection identifies objects in images.

C. Regression

Regression predicts numeric values.

D. Facial recognition

Facial recognition analyzes faces in images or video.


Question 4

A company needs to convert spoken customer service calls into written transcripts.

Which AI workload should be used?

A. Computer vision
B. Speech-to-text
C. OCR
D. Recommendation system


Correct Answer

B. Speech-to-text


Explanation

Speech-to-text converts spoken audio into written text.


Why the Other Answers Are Incorrect

A. Computer vision

Computer vision processes images and video.

C. OCR

OCR extracts text from images, not audio.

D. Recommendation system

Recommendation systems suggest items to users.


Question 5

Which AI workload is MOST appropriate for identifying objects such as cars and pedestrians in traffic camera footage?

A. Text analysis
B. Object detection
C. Speech translation
D. Key phrase extraction


Correct Answer

B. Object detection


Explanation

Object detection identifies and locates objects within images or video.


Why the Other Answers Are Incorrect

A. Text analysis

Text analysis processes written language.

C. Speech translation

Speech translation converts spoken language between languages.

D. Key phrase extraction

Key phrase extraction identifies important terms in text.


Question 6

What is the PRIMARY purpose of OCR?

A. Translating spoken language
B. Extracting text from images or scanned documents
C. Detecting emotions in speech
D. Generating new images


Correct Answer

B. Extracting text from images or scanned documents


Explanation

Optical Character Recognition (OCR) converts printed or handwritten text in images into machine-readable text.


Why the Other Answers Are Incorrect

A. Translating spoken language

This is speech translation.

C. Detecting emotions in speech

This is speech or sentiment analysis.

D. Generating new images

This is a generative AI capability.


Question 7

Which workload is MOST associated with analyzing and processing human language?

A. Natural Language Processing (NLP)
B. Computer vision
C. Regression
D. Clustering


Correct Answer

A. Natural Language Processing (NLP)


Explanation

NLP focuses on understanding, analyzing, and generating human language.


Why the Other Answers Are Incorrect

B. Computer vision

Computer vision works with images and video.

C. Regression

Regression predicts numeric values.

D. Clustering

Clustering groups similar items.


Question 8

A business wants to automatically extract invoice numbers, totals, and dates from scanned invoices.

Which AI workload is MOST appropriate?

A. Recommendation system
B. Information extraction
C. Speech recognition
D. Regression


Correct Answer

B. Information extraction


Explanation

Information extraction retrieves structured information from unstructured documents and often combines OCR and NLP technologies.


Why the Other Answers Are Incorrect

A. Recommendation system

Recommendation systems suggest items.

C. Speech recognition

Speech recognition processes audio.

D. Regression

Regression predicts numbers rather than extracting document data.


Question 9

Which scenario BEST represents a computer vision workload?

A. Translating English text into Spanish
B. Detecting defects on a manufacturing assembly line using cameras
C. Summarizing documents automatically
D. Predicting monthly sales revenue


Correct Answer

B. Detecting defects on a manufacturing assembly line using cameras


Explanation

Computer vision systems analyze visual content such as images and video to identify objects, defects, and patterns.


Why the Other Answers Are Incorrect

A. Translating English text into Spanish

This is an NLP task.

C. Summarizing documents automatically

This is a generative AI or NLP task.

D. Predicting monthly sales revenue

This is a regression task.


Question 10

Which statement BEST describes agentic AI?

A. AI systems that only classify images
B. AI systems that autonomously perform tasks and make decisions
C. AI systems that store relational databases
D. AI systems that only process audio recordings


Correct Answer

B. AI systems that autonomously perform tasks and make decisions


Explanation

Agentic AI systems can reason, plan, interact with tools, and complete multi-step workflows with limited human intervention.


Why the Other Answers Are Incorrect

A. AI systems that only classify images

This describes computer vision tasks.

C. AI systems that store relational databases

Databases are not AI workloads.

D. AI systems that only process audio recordings

Speech AI handles audio processing, not autonomous task execution.


Final Thoughts

Understanding common AI workloads is essential for the AI-901 certification exam and for designing effective AI solutions. Microsoft expects candidates to recognize how different AI technologies solve different business problems and when each workload is most appropriate.

These foundational concepts help build a strong understanding of modern AI systems and Azure AI services.


Go to the AI-901 Exam Prep Hub main page

Describe common Text Analysis techniques, including Keyword Extraction, Entity Detection, Sentiment Analysis, and Summarization (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Identify AI concepts and capabilities (40–45%)
--> Identify AI workloads
--> Describe common Text Analysis techniques, including Keyword Extraction, Entity Detection, Sentiment Analysis, and Summarization


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Text analysis is one of the most common and important AI workloads covered in the AI-901 certification exam. Microsoft expects candidates to understand how AI systems analyze and interpret written language using Natural Language Processing (NLP) techniques.

This topic falls under the “Identify AI workloads” section of the AI-901 exam objectives.


What Is Text Analysis?

Text analysis is an AI workload that uses Natural Language Processing (NLP) to analyze, interpret, and extract meaning from written text.

Text analysis helps organizations process large amounts of unstructured textual data automatically.


Common Sources of Text Data

Organizations analyze text from many sources, including:

  • Emails
  • Customer reviews
  • Social media posts
  • Chat messages
  • Support tickets
  • Surveys
  • Documents
  • Articles

What Is Natural Language Processing (NLP)?

Natural Language Processing (NLP) is a branch of AI focused on helping computers understand and work with human language.

NLP combines:

  • Machine learning
  • Linguistics
  • Statistical analysis
  • Deep learning

NLP enables systems to interpret meaning, emotion, intent, and context within text.


Common Text Analysis Techniques

For the AI-901 exam, important text analysis techniques include:

  • Keyword extraction
  • Entity detection
  • Sentiment analysis
  • Summarization

Additional related techniques include:

  • Language detection
  • Translation
  • Text classification

Keyword Extraction

Keyword extraction identifies the most important words or phrases within text.

The goal is to determine the primary topics or themes.


How Keyword Extraction Works

AI systems analyze text and identify terms that appear most significant based on:

  • Frequency
  • Relevance
  • Context
  • Relationships to other words

Keyword Extraction Examples

Input Text

“The customer was very satisfied with the fast delivery and excellent product quality.”

Extracted Keywords

  • customer
  • fast delivery
  • product quality

Common Use Cases for Keyword Extraction

Search Optimization

Improve document indexing and search engines.

Document Categorization

Identify major document topics automatically.

Customer Feedback Analysis

Detect common issues or themes.

Content Tagging

Automatically assign tags to articles or documents.


Entity Detection

Entity detection identifies important entities mentioned within text.

This technique is often called Named Entity Recognition (NER).


Common Entity Types

AI systems may identify:

  • People
  • Organizations
  • Locations
  • Dates
  • Phone numbers
  • Email addresses
  • Products
  • Currency amounts

Entity Detection Example

Input Text

“Microsoft announced a conference in Seattle on June 15.”

Detected Entities

  • Microsoft → Organization
  • Seattle → Location
  • June 15 → Date

Common Use Cases for Entity Detection

Document Processing

Extract important business information from contracts or forms.

Compliance Monitoring

Identify sensitive information.

Customer Relationship Management

Track companies, customers, or products mentioned in communications.

Search and Analytics

Improve document filtering and organization.


Sentiment Analysis

Sentiment analysis identifies emotional tone or opinion within text.

It determines whether text expresses:

  • Positive sentiment
  • Negative sentiment
  • Neutral sentiment

How Sentiment Analysis Works

AI models analyze words, phrases, and context to estimate emotional tone.

Example Positive Words

  • Excellent
  • Great
  • Amazing

Example Negative Words

  • Poor
  • Terrible
  • Frustrating

Context is important because words can have different meanings depending on usage.


Sentiment Analysis Example

Input Text

“The product quality was excellent, but shipping was slow.”

Possible Sentiment Results

  • Product quality → Positive
  • Shipping experience → Negative

Some systems provide:

  • Overall sentiment
  • Sentence-level sentiment
  • Confidence scores

Common Use Cases for Sentiment Analysis

Customer Feedback Monitoring

Analyze reviews and surveys.

Brand Monitoring

Track public opinion on social media.

Customer Service Improvement

Identify dissatisfied customers.

Market Research

Understand consumer opinions.


Summarization

Summarization creates shorter versions of longer text while preserving key information.

AI summarization helps users quickly understand large amounts of information.


Types of Summarization

Extractive Summarization

Extractive summarization selects important sentences directly from the original text.


Abstractive Summarization

Abstractive summarization generates new sentences that summarize the meaning of the text.

This approach is more similar to how humans summarize information.


Summarization Example

Original Text

“The company reported increased sales this quarter due to strong online demand and improved supply chain performance.”

Summary

“The company experienced increased sales driven by online demand.”


Common Use Cases for Summarization

Meeting Summaries

Condense meeting transcripts.

News Summaries

Provide quick article overviews.

Customer Support

Summarize long support conversations.

Research Assistance

Condense lengthy documents or reports.


Language Detection

Language detection identifies the language used in text.

Example

An AI system determines whether text is:

  • English
  • Spanish
  • French
  • German

Common Use Cases

  • Multilingual applications
  • Translation routing
  • International customer support

Text Classification

Text classification assigns categories or labels to text.

Examples

  • Spam detection
  • Topic categorization
  • Support ticket routing

Real-World Examples


Scenario 1: Customer Review Analysis

Goal

Understand customer opinions.

Techniques Used

  • Sentiment analysis
  • Keyword extraction

Scenario 2: Legal Contract Processing

Goal

Identify important contract information.

Techniques Used

  • Entity detection
  • Summarization

Scenario 3: News Aggregation Platform

Goal

Provide short summaries of articles.

Techniques Used

  • Summarization
  • Keyword extraction

Scenario 4: Customer Support Ticket System

Goal

Automatically categorize and prioritize tickets.

Techniques Used

  • Text classification
  • Sentiment analysis

Azure AI Language Services

Azure AI Language Services provide prebuilt NLP capabilities such as:

  • Sentiment analysis
  • Entity recognition
  • Summarization
  • Language detection
  • Key phrase extraction

These services help developers add text analysis features without building models from scratch.


Structured vs. Unstructured Text Data

Text analysis commonly processes unstructured data.

Structured DataUnstructured Data
DatabasesEmails
TablesDocuments
SpreadsheetsSocial media posts
Defined fieldsReviews

AI systems help convert unstructured text into usable structured information.


Responsible AI Considerations

Organizations using text analysis should consider:

  • Privacy
  • Bias
  • Transparency
  • Security
  • Accuracy
  • Responsible handling of personal data

Text analysis systems may process sensitive information and should be designed carefully.


Important AI-901 Exam Tips

For the exam, remember these key points:

  • Keyword extraction identifies important terms or phrases.
  • Entity detection identifies items such as people, places, organizations, and dates.
  • Sentiment analysis determines emotional tone.
  • Summarization creates shorter versions of text.
  • NLP enables computers to process human language.
  • OCR extracts text from images but is different from text analysis.
  • Summarization may be extractive or abstractive.
  • Text classification assigns categories to text.

Quick Knowledge Check

Question 1

Which text analysis technique identifies emotional tone?

Answer

Sentiment analysis.


Question 2

What does Named Entity Recognition (NER) identify?

Answer

Entities such as people, organizations, locations, and dates.


Question 3

What is the purpose of keyword extraction?

Answer

To identify important words or phrases in text.


Question 4

What does summarization do?

Answer

Creates shorter versions of longer text while preserving key information.


Practice Exam Questions

Question 1

Which text analysis technique identifies the emotional tone of written text?

A. OCR
B. Sentiment analysis
C. Object detection
D. Regression


Correct Answer

B. Sentiment analysis


Explanation

Sentiment analysis determines whether text expresses positive, negative, or neutral emotions or opinions.


Why the Other Answers Are Incorrect

A. OCR

OCR extracts text from images or scanned documents.

C. Object detection

Object detection identifies objects within images.

D. Regression

Regression predicts numeric values.


Question 2

A company wants to automatically identify important phrases from customer feedback forms.

Which text analysis technique is MOST appropriate?

A. Speech synthesis
B. Keyword extraction
C. Facial recognition
D. Image classification


Correct Answer

B. Keyword extraction


Explanation

Keyword extraction identifies the most important words or phrases within text.


Why the Other Answers Are Incorrect

A. Speech synthesis

Speech synthesis converts text into spoken audio.

C. Facial recognition

Facial recognition analyzes faces in images.

D. Image classification

Image classification categorizes images.


Question 3

What is the PRIMARY purpose of Named Entity Recognition (NER)?

A. Predicting future sales
B. Identifying important entities such as people, organizations, and locations in text
C. Translating languages automatically
D. Detecting objects in images


Correct Answer

B. Identifying important entities such as people, organizations, and locations in text


Explanation

NER extracts structured information from text by identifying entities like names, places, dates, and organizations.


Why the Other Answers Are Incorrect

A. Predicting future sales

This is typically a regression task.

C. Translating languages automatically

Translation is a separate NLP capability.

D. Detecting objects in images

This is a computer vision task.


Question 4

Which AI capability creates a shorter version of a document while preserving key information?

A. OCR
B. Summarization
C. Clustering
D. Object detection


Correct Answer

B. Summarization


Explanation

Summarization condenses long text into shorter, meaningful summaries.


Why the Other Answers Are Incorrect

A. OCR

OCR extracts text from images.

C. Clustering

Clustering groups similar data.

D. Object detection

Object detection identifies items within images.


Question 5

A business analyzes product reviews to determine whether customers are satisfied or dissatisfied.

Which AI technique is being used?

A. Sentiment analysis
B. Recommendation system
C. OCR
D. Regression


Correct Answer

A. Sentiment analysis


Explanation

Sentiment analysis evaluates emotional tone and opinions expressed in text.


Why the Other Answers Are Incorrect

B. Recommendation system

Recommendation systems suggest products or content.

C. OCR

OCR extracts text from images.

D. Regression

Regression predicts numeric outcomes.


Question 6

Which statement BEST describes keyword extraction?

A. It converts speech into text
B. It identifies important words or phrases in text
C. It translates text between languages
D. It predicts future trends


Correct Answer

B. It identifies important words or phrases in text


Explanation

Keyword extraction helps determine the main topics or themes within text documents.


Why the Other Answers Are Incorrect

A. It converts speech into text

This is speech recognition.

C. It translates text between languages

This is machine translation.

D. It predicts future trends

This is unrelated to keyword extraction.


Question 7

Which text analysis technique would MOST likely identify “Microsoft” as an organization and “Seattle” as a location?

A. Entity detection
B. Sentiment analysis
C. Speech recognition
D. Image segmentation


Correct Answer

A. Entity detection


Explanation

Entity detection (NER) identifies named entities such as organizations, locations, dates, and people within text.


Why the Other Answers Are Incorrect

B. Sentiment analysis

Sentiment analysis evaluates emotional tone.

C. Speech recognition

Speech recognition processes audio.

D. Image segmentation

Image segmentation is a computer vision task.


Question 8

What is the difference between extractive and abstractive summarization?

A. Extractive summarization uses images, while abstractive summarization uses text
B. Extractive summarization selects sentences from the original text, while abstractive summarization generates new summary wording
C. Extractive summarization only works with speech
D. There is no difference


Correct Answer

B. Extractive summarization selects sentences from the original text, while abstractive summarization generates new summary wording


Explanation

Extractive summarization pulls existing sentences directly from text, while abstractive summarization creates newly generated summaries.


Why the Other Answers Are Incorrect

A. Extractive summarization uses images, while abstractive summarization uses text

Both methods work with text.

C. Extractive summarization only works with speech

Summarization is generally text-based.

D. There is no difference

The two methods are different approaches.


Question 9

Which AI workload category includes keyword extraction, sentiment analysis, and summarization?

A. Computer vision
B. Text analysis
C. Robotics
D. Regression analysis


Correct Answer

B. Text analysis


Explanation

These techniques are part of Natural Language Processing (NLP) and text analysis workloads.


Why the Other Answers Are Incorrect

A. Computer vision

Computer vision focuses on images and video.

C. Robotics

Robotics involves physical machines and automation.

D. Regression analysis

Regression predicts numeric values.


Question 10

A company wants to process thousands of support tickets and automatically identify the most common customer complaints.

Which AI technique would be MOST useful?

A. Object detection
B. Keyword extraction
C. Facial recognition
D. Speech synthesis


Correct Answer

B. Keyword extraction


Explanation

Keyword extraction identifies recurring important phrases and themes within large collections of text.


Why the Other Answers Are Incorrect

A. Object detection

Object detection analyzes images.

C. Facial recognition

Facial recognition identifies people in images or video.

D. Speech synthesis

Speech synthesis converts text into audio.


Final Thoughts

Text analysis is a foundational AI workload and an important topic for the AI-901 certification exam. Microsoft expects candidates to understand common NLP techniques and recognize real-world scenarios where text analysis provides value.

These capabilities help organizations transform large volumes of unstructured text into actionable insights using Azure AI technologies.


Go to the AI-901 Exam Prep Hub main page

Identify features and capabilities of Speech Recognition and Speech Synthesis (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Identify AI concepts and capabilities (40–45%)
--> Identify AI workloads
--> Identify features and capabilities of Speech Recognition and Speech Synthesis


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

AI-901: Microsoft Azure AI Fundamentals (beta)

Speech AI is one of the major AI workloads covered in the AI-901 certification exam. Microsoft expects candidates to understand how AI systems process spoken language using technologies such as speech recognition and speech synthesis.

These capabilities allow computers to listen to, understand, and generate human speech, enabling more natural human-computer interaction.

This topic falls under the “Identify AI workloads” section of the AI-901 exam objectives.


What Is Speech AI?

Speech AI refers to AI technologies that process spoken language and audio.

Speech AI enables systems to:

  • Recognize spoken words
  • Convert speech into text
  • Generate spoken responses
  • Translate spoken language
  • Identify speakers

Speech technologies are commonly used in modern AI assistants and accessibility tools.


Major Speech AI Capabilities

For the AI-901 exam, important speech AI capabilities include:

  • Speech recognition
  • Speech synthesis
  • Speech translation
  • Speaker recognition

The primary focus of this topic is speech recognition and speech synthesis.


What Is Speech Recognition?

Speech recognition converts spoken language into written text.

It is often called:

  • Speech-to-text
  • Automatic Speech Recognition (ASR)

Speech recognition allows computers to “listen” to human speech and interpret it as text.


How Speech Recognition Works

Speech recognition systems typically perform these steps:

  1. Capture audio input
  2. Analyze sound patterns
  3. Identify spoken words
  4. Convert speech into text output

Modern speech recognition systems often use:

  • Machine learning
  • Deep learning
  • Neural networks
  • Large speech datasets

Speech Recognition Example

Spoken Input

“Schedule a meeting for tomorrow at 2 PM.”

Text Output

Schedule a meeting for tomorrow at 2 PM.


Common Features of Speech Recognition

Speech recognition systems may support:

  • Real-time transcription
  • Multiple languages
  • Noise reduction
  • Speaker identification
  • Continuous speech recognition
  • Command recognition

Real-Time Transcription

Real-time transcription converts speech into text immediately as someone speaks.

Common Use Cases

  • Live captions
  • Meeting transcription
  • Accessibility tools

Example

A video conferencing platform generates live subtitles during meetings.


Continuous Speech Recognition

Continuous speech recognition processes natural conversation without requiring pauses between words.

Example

Voice assistants processing full spoken sentences naturally.


Command Recognition

Some speech systems focus on recognizing specific spoken commands.

Example Commands

  • “Play music”
  • “Turn on the lights”
  • “Call John”

These systems are commonly used in smart devices.


Noise Reduction

Speech recognition systems often include noise filtering capabilities.

This helps improve accuracy in noisy environments.

Example

Recognizing speech in a crowded airport.


Multilingual Speech Recognition

Many modern speech systems support multiple languages and accents.

Example

An AI assistant understanding English, Spanish, and French speakers.


Common Use Cases for Speech Recognition


Virtual Assistants

Examples include voice-controlled assistants that answer questions or perform actions.

Example

A user asks a smart speaker about the weather.


Accessibility Solutions

Speech recognition helps users who cannot type easily.

Example

Voice dictation software for users with disabilities.


Meeting Transcription

Organizations convert meetings into searchable text records.

Example

Automatic meeting notes.


Customer Service Systems

Interactive voice response (IVR) systems process spoken customer requests.

Example

A phone system asks customers to describe their issue verbally.


Hands-Free Applications

Speech recognition supports hands-free operation.

Example

Voice-controlled navigation while driving.


What Is Speech Synthesis?

Speech synthesis converts written text into spoken audio.

It is often called:

  • Text-to-speech (TTS)

Speech synthesis allows computers to “speak” naturally to users.


How Speech Synthesis Works

Speech synthesis systems:

  1. Receive text input
  2. Analyze words and pronunciation
  3. Generate spoken audio output

Modern systems use AI-generated voices that sound increasingly human-like.


Speech Synthesis Example

Text Input

“Your appointment is scheduled for Monday at 10 AM.”

Spoken Output

The system reads the message aloud.


Features of Speech Synthesis

Speech synthesis systems may support:

  • Natural-sounding voices
  • Multiple languages
  • Adjustable speaking speed
  • Voice customization
  • Emotional tone control

Natural Neural Voices

Modern AI systems use neural text-to-speech technology to create more human-like speech.

Benefits include:

  • Improved pronunciation
  • Better intonation
  • More natural rhythm

Voice Customization

Some systems allow organizations to customize voices.

Example

A company creates a branded AI voice for customer support systems.


Adjustable Speech Settings

Speech synthesis systems may allow changes to:

  • Speed
  • Pitch
  • Volume
  • Pronunciation

Common Use Cases for Speech Synthesis


Accessibility Tools

Text-to-speech helps visually impaired users consume written content.

Example

Screen readers reading web pages aloud.


Navigation Systems

GPS applications provide spoken directions.

Example

A navigation app announcing upcoming turns.


Virtual Assistants

AI assistants respond using synthesized speech.

Example

A smart assistant answers spoken questions aloud.


Customer Service Bots

Automated phone systems communicate using AI-generated voices.

Example

A banking system reads account information to customers.


Audiobooks and Learning

Speech synthesis converts written content into audio.

Example

Educational content read aloud automatically.


Speech Translation

Speech translation combines:

  • Speech recognition
  • Language translation
  • Speech synthesis

Example

A conference tool translates spoken English into spoken Spanish.


Speaker Recognition

Speaker recognition identifies or verifies individuals based on voice characteristics.

Types

  • Speaker identification
  • Speaker verification

Example

Voice-based authentication systems.


Challenges in Speech AI

Speech AI systems may face challenges such as:

  • Background noise
  • Strong accents
  • Multiple simultaneous speakers
  • Poor audio quality
  • Specialized vocabulary

Responsible AI Considerations

Speech AI systems should be designed responsibly.

Important considerations include:

  • Privacy
  • Consent
  • Security
  • Accessibility
  • Bias reduction
  • Transparency

Voice data may contain sensitive personal information.


Azure AI Speech Services

Azure AI Speech Services provide cloud-based speech AI capabilities including:

  • Speech-to-text
  • Text-to-speech
  • Speech translation
  • Speaker recognition

These services help developers integrate speech AI into applications without building models from scratch.


Speech Recognition vs. Speech Synthesis

CapabilityDescription
Speech RecognitionConverts speech into text
Speech SynthesisConverts text into spoken audio

Real-World Examples


Scenario 1: Live Meeting Captions

Goal

Convert spoken conversations into text.

Capability Used

Speech recognition


Scenario 2: GPS Navigation App

Goal

Read directions aloud.

Capability Used

Speech synthesis


Scenario 3: Voice-Controlled Smart Home

Goal

Understand spoken commands and respond verbally.

Capabilities Used

  • Speech recognition
  • Speech synthesis

Scenario 4: Audiobook Generator

Goal

Convert books into spoken audio.

Capability Used

Speech synthesis


Important AI-901 Exam Tips

For the exam, remember these key points:

  • Speech recognition converts speech into text.
  • Speech synthesis converts text into spoken audio.
  • Speech-to-text is another term for speech recognition.
  • Text-to-speech is another term for speech synthesis.
  • Real-time transcription supports live captions.
  • Neural voices produce more natural speech.
  • Speech translation combines multiple speech technologies.
  • Speaker recognition identifies individuals using voice characteristics.
  • Speech AI is commonly used in assistants, accessibility tools, and customer service systems.

Quick Knowledge Check

Question 1

What does speech recognition do?

Answer

Converts spoken language into written text.


Question 2

What does speech synthesis do?

Answer

Converts text into spoken audio.


Question 3

What is another name for speech synthesis?

Answer

Text-to-speech (TTS).


Question 4

Which speech capability is used for live meeting captions?

Answer

Speech recognition.


Practice Exam Questions

Question 1

What is the PRIMARY function of speech recognition?

A. Converting images into text
B. Converting spoken language into written text
C. Generating images from prompts
D. Translating text into code


Correct Answer

B. Converting spoken language into written text


Explanation

Speech recognition, also called speech-to-text, converts spoken audio into written text.


Why the Other Answers Are Incorrect

A. Converting images into text

This is OCR functionality.

C. Generating images from prompts

This is a generative AI capability.

D. Translating text into code

This is unrelated to speech recognition.


Question 2

Which capability converts written text into spoken audio?

A. OCR
B. Speech synthesis
C. Object detection
D. Clustering


Correct Answer

B. Speech synthesis


Explanation

Speech synthesis, also called text-to-speech (TTS), generates spoken audio from text.


Why the Other Answers Are Incorrect

A. OCR

OCR extracts text from images.

C. Object detection

Object detection identifies objects in images.

D. Clustering

Clustering groups similar data.


Question 3

A company wants to generate live subtitles during online meetings.

Which AI capability should be used?

A. Speech recognition
B. Speech synthesis
C. Facial recognition
D. Image segmentation


Correct Answer

A. Speech recognition


Explanation

Speech recognition converts spoken conversations into text in real time, enabling live captions and subtitles.


Why the Other Answers Are Incorrect

B. Speech synthesis

Speech synthesis creates spoken audio from text.

C. Facial recognition

Facial recognition analyzes faces in images.

D. Image segmentation

Image segmentation is a computer vision task.


Question 4

What is another common name for speech synthesis?

A. Object detection
B. Text-to-speech
C. Speech-to-text
D. Named Entity Recognition


Correct Answer

B. Text-to-speech


Explanation

Speech synthesis is commonly referred to as text-to-speech (TTS).


Why the Other Answers Are Incorrect

A. Object detection

Object detection identifies objects in images.

C. Speech-to-text

Speech-to-text refers to speech recognition.

D. Named Entity Recognition

NER identifies entities in text.


Question 5

Which scenario BEST demonstrates speech synthesis?

A. A chatbot reading answers aloud to users
B. A camera identifying vehicles on a road
C. A system categorizing customer emails
D. A database sorting sales records


Correct Answer

A. A chatbot reading answers aloud to users


Explanation

Speech synthesis converts text responses into spoken audio for users.


Why the Other Answers Are Incorrect

B. A camera identifying vehicles on a road

This is computer vision.

C. A system categorizing customer emails

This is text classification.

D. A database sorting sales records

This is not a speech AI task.


Question 6

Which feature helps speech recognition systems perform better in noisy environments?

A. Image enhancement
B. Noise reduction
C. OCR optimization
D. Regression tuning


Correct Answer

B. Noise reduction


Explanation

Noise reduction filters background sounds to improve speech recognition accuracy.


Why the Other Answers Are Incorrect

A. Image enhancement

Image enhancement relates to visual processing.

C. OCR optimization

OCR works with images and text extraction.

D. Regression tuning

Regression is unrelated to speech audio processing.


Question 7

A navigation application reads driving directions aloud to users.

Which AI capability is being used?

A. Sentiment analysis
B. Speech synthesis
C. Object detection
D. Language detection


Correct Answer

B. Speech synthesis


Explanation

Speech synthesis converts written navigation instructions into spoken audio.


Why the Other Answers Are Incorrect

A. Sentiment analysis

Sentiment analysis evaluates emotional tone in text.

C. Object detection

Object detection analyzes images.

D. Language detection

Language detection identifies languages in text.


Question 8

Which statement BEST describes speech translation?

A. It converts images into searchable text
B. It combines speech recognition, translation, and speech synthesis
C. It identifies objects in audio recordings
D. It predicts future speech patterns


Correct Answer

B. It combines speech recognition, translation, and speech synthesis


Explanation

Speech translation systems convert spoken language into another language and often generate translated spoken output.


Why the Other Answers Are Incorrect

A. It converts images into searchable text

This is OCR.

C. It identifies objects in audio recordings

This is not a standard speech AI capability.

D. It predicts future speech patterns

This is unrelated to translation systems.


Question 9

What is the PRIMARY purpose of speaker recognition?

A. Generating synthetic voices
B. Identifying or verifying individuals using voice characteristics
C. Translating speech into multiple languages
D. Extracting keywords from documents


Correct Answer

B. Identifying or verifying individuals using voice characteristics


Explanation

Speaker recognition systems use voice patterns to identify or authenticate users.


Why the Other Answers Are Incorrect

A. Generating synthetic voices

This is speech synthesis.

C. Translating speech into multiple languages

This is speech translation.

D. Extracting keywords from documents

This is keyword extraction.


Question 10

Which pair correctly matches the capability with its function?

A. Speech recognition → Converts text into speech
B. Speech synthesis → Converts speech into text
C. Speech recognition → Converts speech into text
D. OCR → Generates spoken audio


Correct Answer

C. Speech recognition → Converts speech into text


Explanation

Speech recognition converts spoken language into written text, while speech synthesis converts text into spoken audio.


Why the Other Answers Are Incorrect

A. Speech recognition → Converts text into speech

This describes speech synthesis.

B. Speech synthesis → Converts speech into text

This describes speech recognition.

D. OCR → Generates spoken audio

OCR extracts text from images.


Final Thoughts

Speech AI technologies are essential components of modern AI systems and are an important topic for the AI-901 certification exam. Microsoft expects candidates to understand how speech recognition and speech synthesis work, along with common business scenarios where these technologies are applied.

These capabilities help organizations build more natural, accessible, and interactive AI-powered experiences using Azure AI services.


Go to the AI-901 Exam Prep Hub main page

Identify features and capabilities of Computer Vision and Image-Generation models (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Identify AI concepts and capabilities (40–45%)
--> Identify AI workloads
--> Identify features and capabilities of Computer Vision and Image-Generation models


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Computer vision and image-generation AI models are important AI workloads covered in the AI-901 certification exam. Microsoft expects candidates to understand how AI systems analyze visual information and generate new images using machine learning and deep learning technologies.

These AI capabilities are widely used in healthcare, manufacturing, security, retail, entertainment, accessibility, and many other industries.

This topic falls under the “Identify AI workloads” section of the AI-901 exam objectives.


What Is Computer Vision?

Computer vision is an AI workload that enables computers to analyze and interpret images and video.

Computer vision systems attempt to simulate human visual understanding.

These systems can:

  • Identify objects
  • Detect faces
  • Read text
  • Analyze scenes
  • Track movement
  • Recognize patterns

How Computer Vision Works

Computer vision models are typically trained using large collections of labeled images.

The models learn patterns such as:

  • Shapes
  • Colors
  • Textures
  • Edges
  • Spatial relationships

Modern computer vision systems commonly use:

  • Deep learning
  • Neural networks
  • Convolutional Neural Networks (CNNs)

Common Computer Vision Capabilities

For the AI-901 exam, important computer vision capabilities include:

  • Image classification
  • Object detection
  • Facial recognition
  • Optical Character Recognition (OCR)
  • Image analysis
  • Image tagging

Image Classification

Image classification identifies the primary subject or category of an image.

The model assigns labels to entire images.


Image Classification Example

Input

An image of a dog.

Output

“Dog”


Common Use Cases for Image Classification

Medical Imaging

Classifying medical scans.

Retail

Categorizing products automatically.

Agriculture

Identifying plant diseases.

Wildlife Monitoring

Recognizing animal species.


Object Detection

Object detection identifies and locates multiple objects within an image.

Unlike image classification, object detection can identify several objects and their positions.


Object Detection Example

Input

Street traffic image.

Output

  • Car
  • Pedestrian
  • Traffic light

with location boundaries around each object.


Common Use Cases for Object Detection

Autonomous Vehicles

Detecting vehicles and pedestrians.

Manufacturing

Identifying defective products.

Security Systems

Detecting unauthorized activity.

Retail Analytics

Monitoring customer movement in stores.


Facial Recognition

Facial recognition identifies or verifies individuals using facial features.


Common Facial Recognition Capabilities

Face Detection

Determines whether faces exist in an image.

Face Verification

Confirms whether two faces belong to the same person.

Face Identification

Identifies a person from a database of known individuals.


Common Use Cases for Facial Recognition

Smartphone Authentication

Unlocking phones using facial recognition.

Building Security

Controlling physical access.

Attendance Systems

Tracking employee attendance.

Airport Security

Identity verification systems.


Optical Character Recognition (OCR)

OCR extracts text from images, scanned documents, or photographs.

OCR converts visual text into machine-readable text.


OCR Example

Input

A scanned invoice image.

Output

Extracted text including:

  • Invoice number
  • Dates
  • Totals

Common OCR Use Cases

Invoice Processing

Automating financial workflows.

Document Digitization

Converting paper documents into searchable digital text.

Receipt Scanning

Extracting purchase information.

Accessibility

Reading text aloud for visually impaired users.


Image Tagging and Image Analysis

Image analysis systems can automatically generate descriptions or tags for images.


Example Tags

An image may receive tags such as:

  • Beach
  • Ocean
  • Sunset
  • Person

Common Use Cases

Photo Organization

Automatically categorizing photos.

Content Moderation

Identifying inappropriate images.

Search Optimization

Improving image search systems.


Video Analysis

Computer vision can also process video streams.

Common Video Analysis Tasks

  • Motion detection
  • Activity recognition
  • Traffic monitoring
  • Surveillance analysis

What Are Image-Generation Models?

Image-generation models create new images using AI.

These models learn visual patterns from training data and generate entirely new content.

Image-generation AI is part of generative AI.


How Image-Generation Models Work

Image-generation systems are trained on large image datasets.

The models learn relationships between:

  • Objects
  • Colors
  • Styles
  • Shapes
  • Text descriptions

Many systems use deep learning architectures such as:

  • Diffusion models
  • Generative Adversarial Networks (GANs)

Text-to-Image Generation

Text-to-image models generate images from written prompts.


Example

Prompt

“A futuristic city at sunset”

Output

An AI-generated image matching the description.


Common Use Cases for Image Generation

Marketing and Advertising

Creating promotional graphics.

Entertainment and Gaming

Generating concept art.

Design Assistance

Creating mockups or creative inspiration.

Education

Generating visual learning content.

Accessibility

Creating visual representations from text descriptions.


Image Editing and Enhancement

Some AI models can edit or enhance existing images.


Common Capabilities

  • Background removal
  • Image restoration
  • Colorization
  • Resolution enhancement
  • Style transfer

Deepfakes and Synthetic Media

AI-generated images and videos can create highly realistic synthetic content.

This technology can be useful but also creates ethical concerns.


Responsible AI Considerations

Computer vision and image-generation systems raise important Responsible AI considerations.

Organizations should consider:

  • Privacy
  • Consent
  • Bias
  • Security
  • Transparency
  • Misuse prevention

Bias in Vision Models

Computer vision systems may perform differently across demographic groups if training data is unbalanced.

Example risks include:

  • Facial recognition inaccuracies
  • Biased image classification
  • Unequal detection accuracy

Ethical Concerns with Image Generation

Potential concerns include:

  • Deepfakes
  • Misinformation
  • Copyright concerns
  • Identity misuse
  • Harmful content generation

Organizations should implement safeguards and moderation systems.


Azure AI Vision Services

Azure AI Vision Services provide prebuilt computer vision capabilities including:

  • Image analysis
  • OCR
  • Face detection
  • Object detection
  • Video analysis

Azure OpenAI and Image Generation

Azure OpenAI Service supports generative AI capabilities, including image-generation models.

These services help organizations build AI-powered creative applications.


Computer Vision vs. Image Generation

CapabilityPurpose
Computer VisionAnalyze and understand images
Image GenerationCreate new images

Real-World Examples


Scenario 1: Self-Driving Car

Goal

Detect vehicles and pedestrians.

Capability Used

Object detection


Scenario 2: Receipt Scanning App

Goal

Extract text from receipts.

Capability Used

OCR


Scenario 3: Social Media Photo Organization

Goal

Automatically tag uploaded photos.

Capability Used

Image analysis and tagging


Scenario 4: AI Art Generator

Goal

Create artwork from text prompts.

Capability Used

Image generation


Scenario 5: Smartphone Face Unlock

Goal

Verify user identity.

Capability Used

Facial recognition


Important AI-901 Exam Tips

For the exam, remember these key points:

  • Computer vision analyzes images and video.
  • Image classification labels entire images.
  • Object detection identifies and locates objects.
  • OCR extracts text from images.
  • Facial recognition identifies or verifies individuals.
  • Image-generation models create new images.
  • Text-to-image systems generate visuals from prompts.
  • Computer vision and generative AI are different workloads.
  • Responsible AI principles are important in vision systems.

Quick Knowledge Check

Question 1

What is the purpose of OCR?

Answer

To extract text from images or scanned documents.


Question 2

What is the difference between image classification and object detection?

Answer

Image classification labels an entire image, while object detection identifies and locates multiple objects within an image.


Question 3

What do image-generation models do?

Answer

They create new images using AI.


Question 4

Which AI capability is commonly used for smartphone face unlock?

Answer

Facial recognition.


Practice Exam Questions

Question 1

What is the PRIMARY purpose of computer vision?

A. Converting speech into text
B. Analyzing and understanding images and video
C. Predicting stock prices
D. Generating database queries


Correct Answer

B. Analyzing and understanding images and video


Explanation

Computer vision enables AI systems to interpret and analyze visual content such as images and video.


Why the Other Answers Are Incorrect

A. Converting speech into text

This is speech recognition.

C. Predicting stock prices

This is typically a regression task.

D. Generating database queries

This is unrelated to computer vision.


Question 2

Which computer vision capability identifies the main subject or category of an image?

A. OCR
B. Image classification
C. Speech synthesis
D. Clustering


Correct Answer

B. Image classification


Explanation

Image classification assigns labels or categories to entire images.


Why the Other Answers Are Incorrect

A. OCR

OCR extracts text from images.

C. Speech synthesis

Speech synthesis converts text into spoken audio.

D. Clustering

Clustering groups similar data.


Question 3

A self-driving car needs to identify pedestrians, traffic signs, and vehicles in real time.

Which AI capability is MOST appropriate?

A. Sentiment analysis
B. Object detection
C. Keyword extraction
D. Language detection


Correct Answer

B. Object detection


Explanation

Object detection identifies and locates multiple objects within images or video streams.


Why the Other Answers Are Incorrect

A. Sentiment analysis

Sentiment analysis evaluates emotional tone in text.

C. Keyword extraction

Keyword extraction identifies important phrases in text.

D. Language detection

Language detection identifies written languages.


Question 4

What is the PRIMARY purpose of Optical Character Recognition (OCR)?

A. Translating speech between languages
B. Extracting text from images or scanned documents
C. Detecting faces in photographs
D. Generating new artwork


Correct Answer

B. Extracting text from images or scanned documents


Explanation

OCR converts text within images into machine-readable text.


Why the Other Answers Are Incorrect

A. Translating speech between languages

This is speech translation.

C. Detecting faces in photographs

This is facial recognition or face detection.

D. Generating new artwork

This is an image-generation capability.


Question 5

Which AI capability is commonly used for smartphone face unlock features?

A. Facial recognition
B. Speech recognition
C. Regression
D. Text summarization


Correct Answer

A. Facial recognition


Explanation

Facial recognition systems identify or verify users using facial features.


Why the Other Answers Are Incorrect

B. Speech recognition

Speech recognition processes spoken language.

C. Regression

Regression predicts numeric values.

D. Text summarization

Summarization condenses text.


Question 6

What is the PRIMARY function of image-generation models?

A. Extracting text from images
B. Creating new images using AI
C. Detecting network intrusions
D. Translating written languages


Correct Answer

B. Creating new images using AI


Explanation

Image-generation models produce new visual content based on learned patterns and prompts.


Why the Other Answers Are Incorrect

A. Extracting text from images

This is OCR.

C. Detecting network intrusions

This is unrelated to image generation.

D. Translating written languages

This is an NLP capability.


Question 7

Which example BEST represents a text-to-image generation system?

A. A chatbot answering questions
B. An AI model creating artwork from a written prompt
C. A speech recognition application
D. A recommendation engine


Correct Answer

B. An AI model creating artwork from a written prompt


Explanation

Text-to-image systems generate images based on textual descriptions.


Why the Other Answers Are Incorrect

A. A chatbot answering questions

This is generative text AI.

C. A speech recognition application

Speech recognition converts speech into text.

D. A recommendation engine

Recommendation systems suggest products or content.


Question 8

What is the key difference between image classification and object detection?

A. Image classification processes audio while object detection processes video
B. Image classification labels an entire image, while object detection identifies and locates multiple objects
C. Object detection only works with text
D. There is no difference


Correct Answer

B. Image classification labels an entire image, while object detection identifies and locates multiple objects


Explanation

Image classification provides a label for an entire image, while object detection identifies multiple objects and their locations.


Why the Other Answers Are Incorrect

A. Image classification processes audio while object detection processes video

Both work with visual data.

C. Object detection only works with text

Object detection works with images and video.

D. There is no difference

These are distinct computer vision tasks.


Question 9

Which Responsible AI concern is MOST associated with image-generation systems?

A. Deepfakes and synthetic media misuse
B. Spreadsheet formatting errors
C. SQL indexing problems
D. Network bandwidth allocation


Correct Answer

A. Deepfakes and synthetic media misuse


Explanation

Image-generation AI can create highly realistic synthetic content, raising concerns about misinformation and misuse.


Why the Other Answers Are Incorrect

B. Spreadsheet formatting errors

This is unrelated to AI image generation.

C. SQL indexing problems

This is a database issue.

D. Network bandwidth allocation

This is unrelated to Responsible AI concerns.


Question 10

A retailer wants to automatically categorize product photos into categories such as shoes, shirts, and electronics.

Which AI capability is MOST appropriate?

A. Image classification
B. OCR
C. Speech synthesis
D. Sentiment analysis


Correct Answer

A. Image classification


Explanation

Image classification assigns category labels to images based on visual content.


Why the Other Answers Are Incorrect

B. OCR

OCR extracts text from images.

C. Speech synthesis

Speech synthesis generates spoken audio.

D. Sentiment analysis

Sentiment analysis evaluates emotional tone in text.


Final Thoughts

Computer vision and image-generation AI models are essential components of modern AI systems and important topics for the AI-901 certification exam. Microsoft expects candidates to understand how AI systems analyze visual information and generate new content, along with common business scenarios where these technologies are applied.

These capabilities help organizations build intelligent visual applications using Azure AI services and generative AI technologies.


Go to the AI-901 Exam Prep Hub main page

Identify techniques to extract information from text, images, audio, and videos (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Identify AI concepts and capabilities (40–45%)
--> Identify AI workloads
--> Identify techniques to extract information from text, images, audio, and videos


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Information extraction is one of the most valuable uses of AI and an important topic for the AI-901 certification exam. Organizations generate enormous amounts of unstructured data every day, including documents, emails, images, audio recordings, and videos. AI systems help convert this unstructured data into structured, usable information.

Microsoft expects AI-901 candidates to understand common techniques used to extract information from text, images, audio, and video content.

This topic falls under the “Identify AI workloads” section of the AI-901 exam objectives.


What Is Information Extraction?

Information extraction is the process of identifying and retrieving useful structured information from unstructured or semi-structured data.

AI systems analyze content and extract meaningful data automatically.


Examples of Information Extraction

SourceExtracted Information
DocumentsNames, dates, invoice totals
EmailsCustomer requests, keywords
ImagesObjects, faces, text
AudioSpoken words, speaker identity
VideoActivities, objects, movement

Structured vs. Unstructured Data

Understanding structured and unstructured data is important for this topic.

Structured DataUnstructured Data
TablesEmails
DatabasesImages
SpreadsheetsAudio
Defined formatsVideos
Organized fieldsDocuments

AI techniques help transform unstructured data into structured information.


Information Extraction from Text

AI systems commonly use Natural Language Processing (NLP) to extract information from text.


Common Text Extraction Techniques

For the AI-901 exam, important techniques include:

  • Keyword extraction
  • Named Entity Recognition (NER)
  • Sentiment analysis
  • Summarization
  • Language detection
  • Text classification

Keyword Extraction

Keyword extraction identifies important words or phrases within text.

Example

Extracting phrases like:

  • “shipping delay”
  • “billing issue”
  • “customer satisfaction”

from support tickets.


Named Entity Recognition (NER)

NER identifies entities such as:

  • People
  • Organizations
  • Locations
  • Dates
  • Phone numbers
  • Products

Example

Input

“Microsoft will host an event in Seattle on June 15.”

Extracted Entities

  • Microsoft → Organization
  • Seattle → Location
  • June 15 → Date

Sentiment Analysis

Sentiment analysis identifies emotional tone within text.

Possible Results

  • Positive
  • Negative
  • Neutral

Example

Analyzing customer reviews to determine satisfaction levels.


Summarization

Summarization creates shorter versions of long text.

Example

Generating meeting summaries from lengthy transcripts.


Text Classification

Text classification assigns categories to text.

Example

Automatically labeling emails as:

  • Support
  • Sales
  • Billing

Information Extraction from Images

Computer vision techniques extract information from images.


Common Image Extraction Techniques

Important techniques include:

  • OCR
  • Image classification
  • Object detection
  • Facial recognition
  • Image tagging

Optical Character Recognition (OCR)

OCR extracts text from images and scanned documents.


OCR Example

Input

Scanned invoice image.

Extracted Information

  • Invoice number
  • Total amount
  • Vendor name
  • Dates

Common OCR Use Cases

  • Receipt scanning
  • Invoice processing
  • Document digitization
  • Form extraction

Image Classification

Image classification identifies the overall category of an image.

Example

Identifying whether an image contains:

  • A dog
  • A car
  • A building

Object Detection

Object detection identifies and locates multiple objects within images.

Example

Detecting:

  • Cars
  • Pedestrians
  • Traffic lights

in a street image.


Facial Recognition

Facial recognition identifies or verifies people based on facial features.

Example

Smartphone face unlock systems.


Image Tagging

Image tagging automatically generates descriptive labels.

Example Tags

  • Beach
  • Sunset
  • Ocean
  • Person

Information Extraction from Audio

Speech AI technologies extract information from spoken audio.


Common Audio Extraction Techniques

Important techniques include:

  • Speech recognition
  • Speaker recognition
  • Sentiment analysis in speech
  • Speech translation

Speech Recognition

Speech recognition converts spoken language into text.

Also called:

  • Speech-to-text
  • Automatic Speech Recognition (ASR)

Example

Audio Input

A recorded meeting.

Extracted Information

A written transcript.


Speaker Recognition

Speaker recognition identifies or verifies speakers based on voice characteristics.

Example

Voice authentication systems.


Speech Sentiment Analysis

Some AI systems analyze vocal tone and emotion.

Example

Detecting frustration during customer service calls.


Speech Translation

Speech translation converts spoken language into another language.

Example

Real-time multilingual meeting translation.


Information Extraction from Video

Video analysis combines computer vision and audio processing techniques.


Common Video Extraction Techniques

Important techniques include:

  • Motion detection
  • Object tracking
  • Activity recognition
  • Scene analysis
  • Video transcription

Motion Detection

Motion detection identifies movement within video footage.

Example

Security surveillance systems detecting activity.


Object Tracking

Object tracking follows identified objects across video frames.

Example

Tracking vehicles in traffic monitoring systems.


Activity Recognition

Activity recognition identifies actions occurring in video.

Example

Detecting:

  • Running
  • Falling
  • Fighting
  • Driving

Scene Analysis

Scene analysis identifies environments or contexts in video.

Example

Recognizing:

  • Office scenes
  • Outdoor settings
  • Crowded areas

Video Transcription

Video transcription converts spoken content in videos into text.

Example

Generating subtitles for videos automatically.


Multimodal AI

Some AI systems combine multiple data types together.

This is called multimodal AI.


Example of Multimodal AI

A meeting assistant may process:

  • Audio
  • Video
  • Text chat
  • Shared documents

simultaneously.


Real-World Information Extraction Scenarios


Scenario 1: Invoice Processing System

Goal

Extract invoice information automatically.

Techniques Used

  • OCR
  • Entity extraction

Scenario 2: Customer Support Analysis

Goal

Analyze customer complaints.

Techniques Used

  • Sentiment analysis
  • Keyword extraction

Scenario 3: Smart Security Camera

Goal

Detect suspicious activity.

Techniques Used

  • Object detection
  • Motion detection
  • Facial recognition

Scenario 4: Meeting Intelligence Platform

Goal

Generate searchable meeting notes.

Techniques Used

  • Speech recognition
  • Summarization
  • Speaker recognition

Scenario 5: Video Streaming Platform

Goal

Generate subtitles automatically.

Techniques Used

  • Speech recognition
  • Video transcription

Azure AI Services for Information Extraction

Azure AI Services provide tools for extracting information from multiple data types.

Common services include:

  • Azure AI Language
  • Azure AI Speech
  • Azure AI Vision
  • Azure AI Document Intelligence

These services allow organizations to build AI solutions without training models from scratch.


Responsible AI Considerations

Information extraction systems should follow Responsible AI principles.

Important considerations include:

  • Privacy
  • Consent
  • Data security
  • Transparency
  • Bias reduction
  • Compliance

Sensitive personal information may be present in extracted data.


Challenges in Information Extraction

AI systems may face challenges such as:

  • Poor image quality
  • Background noise
  • Ambiguous language
  • Multiple speakers
  • Handwritten text
  • Video quality issues

Performance depends heavily on data quality.


Important AI-901 Exam Tips

For the exam, remember these key points:

  • NLP extracts information from text.
  • OCR extracts text from images.
  • Speech recognition converts speech into text.
  • Object detection identifies and locates objects in images or video.
  • Video analysis can detect activities and movement.
  • Information extraction converts unstructured data into structured information.
  • Multimodal AI combines multiple data types.
  • Azure AI services provide prebuilt information extraction capabilities.

Quick Knowledge Check

Question 1

Which technique extracts text from scanned documents?

Answer

OCR.


Question 2

What does speech recognition do?

Answer

Converts spoken language into text.


Question 3

Which technique identifies objects within images?

Answer

Object detection.


Question 4

What is multimodal AI?

Answer

AI systems that process multiple types of data together, such as text, audio, and images.


Practice Exam Questions

Question 1

Which AI technique is used to extract text from scanned documents or images?

A. Sentiment analysis
B. Optical Character Recognition (OCR)
C. Object detection
D. Speech synthesis


Correct Answer

B. Optical Character Recognition (OCR)


Explanation

OCR extracts machine-readable text from images, scanned documents, and photographs.


Why the Other Answers Are Incorrect

A. Sentiment analysis

Sentiment analysis identifies emotional tone in text.

C. Object detection

Object detection identifies objects within images.

D. Speech synthesis

Speech synthesis converts text into spoken audio.


Question 2

A company wants to convert recorded customer support calls into written transcripts.

Which AI capability should be used?

A. Speech recognition
B. Facial recognition
C. Image classification
D. Regression


Correct Answer

A. Speech recognition


Explanation

Speech recognition converts spoken language into written text.


Why the Other Answers Are Incorrect

B. Facial recognition

Facial recognition analyzes faces in images.

C. Image classification

Image classification categorizes images.

D. Regression

Regression predicts numeric values.


Question 3

Which AI technique identifies and locates multiple objects within an image?

A. OCR
B. Object detection
C. Summarization
D. Clustering


Correct Answer

B. Object detection


Explanation

Object detection identifies objects and their positions within images or video frames.


Why the Other Answers Are Incorrect

A. OCR

OCR extracts text from images.

C. Summarization

Summarization condenses text.

D. Clustering

Clustering groups similar data points.


Question 4

A business wants to automatically determine whether customer reviews are positive or negative.

Which AI technique is MOST appropriate?

A. Sentiment analysis
B. OCR
C. Facial recognition
D. Image tagging


Correct Answer

A. Sentiment analysis


Explanation

Sentiment analysis evaluates emotional tone and opinions in text.


Why the Other Answers Are Incorrect

B. OCR

OCR extracts text from images.

C. Facial recognition

Facial recognition identifies people from images.

D. Image tagging

Image tagging labels image content.


Question 5

Which AI capability is commonly used to identify names, locations, and organizations within text?

A. Named Entity Recognition (NER)
B. Speech synthesis
C. Object tracking
D. Regression analysis


Correct Answer

A. Named Entity Recognition (NER)


Explanation

NER extracts entities such as people, organizations, dates, and locations from text.


Why the Other Answers Are Incorrect

B. Speech synthesis

Speech synthesis generates spoken audio.

C. Object tracking

Object tracking follows objects in video.

D. Regression analysis

Regression predicts numeric values.


Question 6

A smart security camera tracks moving vehicles across multiple video frames.

Which AI technique is being used?

A. Text classification
B. Object tracking
C. Summarization
D. Speech translation


Correct Answer

B. Object tracking


Explanation

Object tracking follows identified objects as they move through video footage.


Why the Other Answers Are Incorrect

A. Text classification

Text classification categorizes written text.

C. Summarization

Summarization condenses text.

D. Speech translation

Speech translation converts spoken language between languages.


Question 7

Which term describes AI systems that process multiple data types such as text, images, and audio together?

A. Regression AI
B. Multimodal AI
C. Clustering AI
D. Rule-based AI


Correct Answer

B. Multimodal AI


Explanation

Multimodal AI combines and processes multiple forms of data simultaneously.


Why the Other Answers Are Incorrect

A. Regression AI

Regression predicts numeric values.

C. Clustering AI

Clustering groups similar items.

D. Rule-based AI

Rule-based systems follow predefined logic rules.


Question 8

Which AI capability would MOST likely be used to generate automatic subtitles for videos?

A. Speech recognition
B. Image classification
C. Facial recognition
D. Recommendation systems


Correct Answer

A. Speech recognition


Explanation

Speech recognition converts spoken words in videos into text subtitles.


Why the Other Answers Are Incorrect

B. Image classification

Image classification categorizes images.

C. Facial recognition

Facial recognition identifies people in images.

D. Recommendation systems

Recommendation systems suggest content or products.


Question 9

A retailer wants AI to automatically identify products such as shoes, shirts, and electronics in uploaded images.

Which AI capability should be used?

A. Object detection
B. Sentiment analysis
C. Speech synthesis
D. Language translation


Correct Answer

A. Object detection


Explanation

Object detection identifies multiple objects within images and can locate them visually.


Why the Other Answers Are Incorrect

B. Sentiment analysis

Sentiment analysis evaluates text emotion.

C. Speech synthesis

Speech synthesis converts text into speech.

D. Language translation

Language translation converts text or speech between languages.


Question 10

What is the PRIMARY goal of information extraction AI systems?

A. Creating video games
B. Converting unstructured data into useful structured information
C. Compressing database files
D. Replacing all human decision-making


Correct Answer

B. Converting unstructured data into useful structured information


Explanation

Information extraction systems analyze unstructured content such as text, images, audio, and video to retrieve meaningful structured data.


Why the Other Answers Are Incorrect

A. Creating video games

This is unrelated to information extraction.

C. Compressing database files

This is a storage task, not AI extraction.

D. Replacing all human decision-making

AI systems are designed to assist and augment human processes, not completely replace all decision-making.


Final Thoughts

Information extraction is one of the most practical and widely used AI workloads covered in the AI-901 certification exam. Microsoft expects candidates to understand how AI systems extract useful insights from text, images, audio, and videos using NLP, speech AI, computer vision, and multimodal AI technologies.

These capabilities help organizations automate workflows, analyze large volumes of data, and build intelligent applications using Azure AI services.


Go to the AI-901 Exam Prep Hub main page

Identify appropriate model deployment options and configuration parameters (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Identify AI concepts and capabilities (40–45%)
--> Identify AI model components and configurations
--> Identify appropriate model deployment options and configuration parameters


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Deploying AI models effectively is an important part of building real-world AI solutions and a key topic for the AI-901 certification exam. Microsoft expects candidates to understand common deployment options, model hosting approaches, and basic configuration parameters used in AI systems.

This topic falls under the “Identify AI model components and configurations” section of the exam objectives.


What Is AI Model Deployment?

Model deployment is the process of making a trained AI model available for real-world use.

After a model is trained and tested, it must be deployed so applications and users can interact with it.

Examples

  • A chatbot answering customer questions
  • A fraud detection model analyzing transactions
  • An image recognition system processing uploaded photos
  • A recommendation engine suggesting products

Deployment connects the AI model to users and applications.


Common AI Model Deployment Options

AI models can be deployed in different environments depending on business needs.

Common deployment options include:

  • Cloud deployment
  • Edge deployment
  • On-premises deployment
  • Containerized deployment
  • Real-time inference
  • Batch inference

Cloud Deployment

Cloud deployment hosts AI models in cloud platforms such as Microsoft Azure.

Benefits

  • Scalability
  • High availability
  • Managed infrastructure
  • Easier updates
  • Flexible resource allocation

Common Use Cases

  • Web applications
  • Chatbots
  • APIs
  • Enterprise AI services

Example

A customer support chatbot hosted in Azure and accessed through a website.


Edge Deployment

Edge deployment runs AI models on local devices near the data source.

Examples of Edge Devices

  • Smartphones
  • IoT devices
  • Cameras
  • Manufacturing equipment
  • Vehicles

Benefits

  • Reduced latency
  • Offline operation
  • Faster response times
  • Reduced bandwidth usage

Example

A factory camera performing real-time defect detection directly on the device.


On-Premises Deployment

On-premises deployment hosts AI models within an organization’s own data center.

Benefits

  • Greater control over data
  • Compliance support
  • Internal network security
  • Reduced external data sharing

Common Use Cases

  • Highly regulated industries
  • Sensitive data environments

Example

A hospital deploying AI systems within its internal infrastructure for patient privacy reasons.


Containerized Deployment

Containers package AI models and their dependencies into portable units.

Common container technologies include:

  • Docker
  • Kubernetes

Benefits

  • Portability
  • Consistent environments
  • Easier scaling
  • Simplified deployment

Example

Deploying an AI API inside a Docker container across multiple servers.


Real-Time Inference

Real-time inference provides immediate AI predictions or responses.

Characteristics

  • Low latency
  • Fast responses
  • Interactive applications

Example Use Cases

  • Chatbots
  • Fraud detection during transactions
  • Live recommendation systems
  • Voice assistants

Example

A chatbot generating responses instantly during a conversation.


Batch Inference

Batch inference processes large amounts of data at scheduled intervals.

Characteristics

  • High-volume processing
  • Non-interactive
  • Scheduled operations

Example Use Cases

  • Overnight report generation
  • Bulk image processing
  • Customer segmentation updates

Example

A retailer analyzing all sales data nightly to update recommendations.


APIs and Endpoints

Deployed AI models are often accessed through APIs (Application Programming Interfaces).

An endpoint is a network location where applications send requests to the AI model.

Example

A mobile app sends an image to an AI vision API endpoint for analysis.


Scalability

Scalability refers to the ability of a deployment to handle increasing workloads.

Cloud deployments often scale automatically based on:

  • Number of requests
  • CPU usage
  • Memory usage

Example

An AI chatbot automatically adds more computing resources during peak business hours.


Latency

Latency refers to response time.

Some applications require very low latency.

Low-Latency Examples

  • Autonomous vehicles
  • Fraud detection
  • Real-time translation
  • Voice assistants

Edge deployment is often used to reduce latency.


Availability and Reliability

AI systems should remain available and reliable.

High availability helps ensure systems continue functioning even during failures.

Common techniques include:

  • Redundant servers
  • Load balancing
  • Failover systems
  • Monitoring

Model Monitoring

After deployment, AI systems should be monitored continuously.

Monitoring helps identify:

  • Performance degradation
  • Bias
  • Security issues
  • Reliability problems
  • Model drift

Example

A fraud detection model becomes less accurate as customer behavior changes over time.


Model Drift

Model drift occurs when real-world data changes over time, causing reduced model accuracy.

Example

A recommendation system trained on older shopping trends may become less effective as customer preferences change.

Monitoring helps detect model drift.


AI Model Configuration Parameters

AI systems often include configurable settings that affect behavior and performance.

For AI-901, important parameters include:

  • Temperature
  • Max tokens
  • Top-p
  • Frequency penalty
  • Presence penalty

These are especially important for generative AI systems.


Temperature

Temperature controls randomness and creativity in generated responses.

TemperatureBehavior
LowMore predictable and focused
HighMore creative and varied

Example

A customer support chatbot may use a lower temperature for consistent answers.


Max Tokens

Max tokens controls the maximum length of generated output.

Example

A summarization system may limit responses to 200 tokens.


Top-p (Nucleus Sampling)

Top-p controls how many likely next-token choices the model considers.

Lower values create more focused responses.

Higher values allow greater variety.


Frequency Penalty

Frequency penalty reduces repeated words or phrases in generated text.

Example

Helps prevent repetitive chatbot responses.


Presence Penalty

Presence penalty encourages the model to introduce new topics or ideas.

This can increase response diversity.


Choosing Deployment Options

Selecting the correct deployment approach depends on:

RequirementPossible Deployment Choice
Low latencyEdge deployment
Large scalabilityCloud deployment
Sensitive dataOn-premises deployment
PortabilityContainers
Instant responsesReal-time inference
Large scheduled jobsBatch inference

Real-World Examples


Scenario 1: AI Chatbot

Requirements

  • Instant responses
  • Large user base
  • Internet access

Best Deployment

Cloud-based real-time deployment

Useful Parameters

  • Low temperature
  • Moderate max tokens

Scenario 2: Factory Defect Detection

Requirements

  • Very low latency
  • Works without internet

Best Deployment

Edge deployment


Scenario 3: Monthly Sales Forecasting

Requirements

  • Analyze large historical datasets
  • No immediate response needed

Best Deployment

Batch inference


Scenario 4: Healthcare AI System

Requirements

  • Strict privacy controls
  • Sensitive patient data

Best Deployment

On-premises deployment


Azure AI Deployment Options

Microsoft Azure AI Services provide multiple deployment approaches for AI solutions, including:

  • Cloud-hosted AI APIs
  • Container support
  • Edge deployment support
  • Managed AI services
  • Scalable inference endpoints

Azure simplifies deployment, scaling, and management of AI systems.


Responsible AI Considerations

When deploying AI models, organizations should also consider:

  • Security
  • Privacy
  • Reliability
  • Monitoring
  • Transparency
  • Accountability

Poor deployment practices can create operational or ethical risks.


Important AI-901 Exam Tips

For the exam, remember these key points:

  • Deployment makes AI models available for use.
  • Cloud deployment offers scalability and flexibility.
  • Edge deployment reduces latency and supports offline operation.
  • On-premises deployment provides greater internal control.
  • Real-time inference supports immediate responses.
  • Batch inference processes large datasets on schedules.
  • APIs and endpoints connect applications to AI models.
  • Model drift occurs when real-world data changes over time.
  • Temperature controls creativity in generative AI responses.
  • Max tokens controls output length.

Quick Knowledge Check

Question 1

What deployment option is best for very low-latency AI processing on local devices?

Answer

Edge deployment.


Question 2

What does temperature control in generative AI?

Answer

The randomness and creativity of generated responses.


Question 3

What is batch inference?

Answer

Processing large amounts of data at scheduled intervals rather than in real time.


Question 4

What is model drift?

Answer

Reduced model performance caused by changes in real-world data over time.


Practice Exam Questions

Question 1

A company needs an AI-powered chatbot that can instantly respond to customer questions on its website.

Which deployment type is MOST appropriate?

A. Batch inference
B. Real-time inference
C. Offline archival storage
D. Manual processing


Correct Answer

B. Real-time inference


Explanation

Real-time inference provides immediate responses and is commonly used for interactive applications such as chatbots.


Why the Other Answers Are Incorrect

A. Batch inference

Batch inference processes data on schedules rather than instantly.

C. Offline archival storage

Archival storage does not provide live AI responses.

D. Manual processing

Manual processing is not an AI deployment method.


Question 2

What is the PRIMARY benefit of edge deployment for AI models?

A. Unlimited cloud scalability
B. Reduced latency and local processing
C. Increased internet bandwidth usage
D. Automatic model retraining


Correct Answer

B. Reduced latency and local processing


Explanation

Edge deployment places AI models close to the data source, reducing response time and allowing operation even with limited internet connectivity.


Why the Other Answers Are Incorrect

A. Unlimited cloud scalability

This is more associated with cloud deployment.

C. Increased internet bandwidth usage

Edge deployment often reduces bandwidth usage.

D. Automatic model retraining

Edge deployment does not automatically retrain models.


Question 3

Which deployment option provides the MOST control over sensitive organizational data?

A. Public social media deployment
B. On-premises deployment
C. Edge gaming deployment
D. Anonymous deployment


Correct Answer

B. On-premises deployment


Explanation

On-premises deployment keeps systems and data within an organization’s internal infrastructure, supporting security and compliance needs.


Why the Other Answers Are Incorrect

A. Public social media deployment

This is not a standard deployment option.

C. Edge gaming deployment

This is not a recognized AI deployment category.

D. Anonymous deployment

This is not a deployment model.


Question 4

What does the temperature parameter control in many generative AI models?

A. The physical temperature of the servers
B. The creativity and randomness of generated responses
C. The storage capacity of the model
D. The speed of internet connections


Correct Answer

B. The creativity and randomness of generated responses


Explanation

Temperature controls how predictable or creative AI-generated outputs are.

Lower values create more focused responses, while higher values create more varied responses.


Why the Other Answers Are Incorrect

A. The physical temperature of the servers

Temperature is a model setting, not a hardware measurement.

C. The storage capacity of the model

Temperature does not affect storage.

D. The speed of internet connections

Temperature is unrelated to networking.


Question 5

A company processes millions of sales records every night to generate forecasts for the next day.

Which inference type is MOST appropriate?

A. Real-time inference
B. Batch inference
C. Edge inference
D. Interactive inference only


Correct Answer

B. Batch inference


Explanation

Batch inference is designed for large-scale scheduled processing rather than immediate responses.


Why the Other Answers Are Incorrect

A. Real-time inference

Real-time inference is intended for immediate responses.

C. Edge inference

Edge inference focuses on local device processing.

D. Interactive inference only

This is not a standard inference category.


Question 6

What is model drift?

A. A networking issue in cloud deployments
B. Reduced model performance caused by changes in real-world data over time
C. A method for encrypting AI outputs
D. A hardware failure in GPU systems


Correct Answer

B. Reduced model performance caused by changes in real-world data over time


Explanation

Model drift occurs when data patterns change after deployment, causing model accuracy to decline.


Why the Other Answers Are Incorrect

A. A networking issue in cloud deployments

Drift relates to data and performance, not networking.

C. A method for encrypting AI outputs

Drift is unrelated to encryption.

D. A hardware failure in GPU systems

Hardware failures are separate operational issues.


Question 7

Which deployment approach is MOST suitable for AI systems that must continue operating without internet access?

A. Cloud-only deployment
B. Edge deployment
C. Browser caching
D. Remote archival deployment


Correct Answer

B. Edge deployment


Explanation

Edge deployment allows AI models to run locally on devices, enabling offline functionality.


Why the Other Answers Are Incorrect

A. Cloud-only deployment

Cloud-only systems usually require internet connectivity.

C. Browser caching

Caching is not an AI deployment strategy.

D. Remote archival deployment

This is not a standard deployment model.


Question 8

What is the purpose of the max tokens parameter in generative AI?

A. To control the maximum response length
B. To encrypt generated text
C. To increase hardware memory
D. To reduce internet latency


Correct Answer

A. To control the maximum response length


Explanation

Max tokens limits how much text the model can generate in a response.


Why the Other Answers Are Incorrect

B. To encrypt generated text

Max tokens does not affect encryption.

C. To increase hardware memory

It does not change hardware capacity.

D. To reduce internet latency

It is unrelated to network speed.


Question 9

What is an AI endpoint?

A. A backup storage device
B. A network location where applications send requests to an AI model
C. A hardware cooling system
D. A type of training dataset


Correct Answer

B. A network location where applications send requests to an AI model


Explanation

Endpoints allow applications and users to interact with deployed AI models through APIs.


Why the Other Answers Are Incorrect

A. A backup storage device

Endpoints are not storage systems.

C. A hardware cooling system

Cooling systems are unrelated.

D. A type of training dataset

Endpoints are deployment interfaces.


Question 10

Which deployment option is MOST associated with automatic scalability and managed infrastructure?

A. Cloud deployment
B. Manual deployment
C. Printed deployment
D. Standalone spreadsheet deployment


Correct Answer

A. Cloud deployment


Explanation

Cloud deployment platforms such as Microsoft Azure provide scalable infrastructure and managed services for AI workloads.


Why the Other Answers Are Incorrect

B. Manual deployment

Manual deployment does not provide automatic scalability.

C. Printed deployment

This is not a valid deployment option.

D. Standalone spreadsheet deployment

Spreadsheets are not scalable AI deployment platforms.


Final Thoughts

Understanding AI deployment options and configuration parameters is an important foundational skill for the AI-901 certification exam. Microsoft expects candidates to recognize when different deployment strategies and model settings are appropriate for business and technical requirements.

These concepts help organizations deploy scalable, reliable, and effective AI solutions using Azure AI technologies.


Go to the AI-901 Exam Prep Hub main page

Describe how generative AI models work (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Identify AI concepts and capabilities (40–45%)
--> Identify AI model components and configurations
--> Describe how generative AI models work


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Generative AI is one of the most important and rapidly growing areas of artificial intelligence and is a major topic for the AI-901 certification exam. Microsoft includes generative AI concepts within the “Identify AI model components and configurations” section of the exam objectives.

Understanding how generative AI models work means understanding how AI systems can create new content such as text, images, audio, code, and video based on patterns learned from large datasets.


What Is Generative AI?

Generative AI refers to AI systems that can generate new content based on patterns learned from training data.

Unlike traditional AI systems that primarily classify or predict, generative AI creates original outputs.

Examples of Generated Content

  • Text
  • Images
  • Music
  • Speech
  • Code
  • Video

Example Applications

  • AI chatbots
  • Image generators
  • Code assistants
  • Content summarization
  • Translation systems
  • Virtual assistants

How Generative AI Differs from Traditional AI

Traditional AIGenerative AI
Classifies or predictsCreates new content
Detects spam emailsWrites emails
Identifies objects in imagesGenerates images
Predicts sales trendsCreates reports or summaries

Traditional AI often answers questions like:

  • “What category does this belong to?”
  • “What will likely happen next?”

Generative AI answers questions like:

  • “Create something new.”
  • “Generate content based on this prompt.”

Foundation Models

Many generative AI systems are built using foundation models.

A foundation model is a very large AI model trained on massive amounts of data that can be adapted for many tasks.

Foundation models learn general patterns in:

  • Language
  • Images
  • Audio
  • Code
  • Knowledge relationships

These models can then be specialized or prompted for different use cases.


Large Language Models (LLMs)

Large Language Models (LLMs) are a type of generative AI model focused on understanding and generating human language.

Examples include systems used for:

  • Chatbots
  • Writing assistants
  • Summarization
  • Translation
  • Question answering
  • Code generation

LLMs are trained using enormous collections of text data from books, articles, websites, and other sources.


How Large Language Models Work

At a high level, LLMs work by predicting the most likely next word or token in a sequence.

Example

If the model sees:

“The sky is…”

It may predict:

“blue”

By repeatedly predicting the next token, the model can generate sentences, paragraphs, and conversations.


Tokens in Generative AI

Generative AI models process information as tokens.

Tokens are small units of text, which may represent:

  • Words
  • Parts of words
  • Characters
  • Punctuation

Example

The sentence:

“AI is powerful”

might be broken into tokens such as:

  • “AI”
  • “is”
  • “powerful”

The model predicts tokens one at a time to generate output.


Neural Networks and Deep Learning

Generative AI models are built using deep learning neural networks.

Neural networks are systems inspired by the structure of the human brain.

These networks contain many layers that learn patterns from data.

Generative AI models often contain:

  • Millions
  • Billions
  • Or even trillions of parameters

Parameters are internal values learned during training that help the model recognize relationships and patterns.


Transformers

Most modern generative AI systems use a neural network architecture called the Transformer.

Transformers are highly effective for processing sequences such as language.

Transformers help models:

  • Understand context
  • Recognize relationships between words
  • Handle long passages of text
  • Generate coherent responses

The Transformer architecture is a foundational technology behind many modern AI systems.


Training Generative AI Models

Training a generative AI model involves exposing it to massive datasets.

During training, the model learns patterns and relationships by repeatedly predicting missing or next tokens.

Simplified Training Process

  1. Provide training data
  2. Hide or predict portions of the data
  3. Compare predictions to actual results
  4. Adjust model parameters
  5. Repeat many times

This process may require enormous computing power and specialized hardware such as GPUs.


Pretraining and Fine-Tuning

Generative AI training often occurs in two stages.


Pretraining

The model learns general knowledge and patterns from very large datasets.

Example

An LLM may learn grammar, facts, reasoning patterns, and language structure from internet-scale text data.


Fine-Tuning

The pretrained model is then adapted for specific tasks or domains.

Example

A healthcare chatbot may be fine-tuned using medical terminology and healthcare conversations.

Fine-tuning improves performance for specialized use cases.


Prompts and Prompt Engineering

Users interact with generative AI systems using prompts.

A prompt is the input or instruction given to the model.

Examples

  • “Write a summary of this article.”
  • “Generate an image of a beach at sunset.”
  • “Explain machine learning simply.”

Prompt engineering refers to designing prompts that produce better outputs.

Well-structured prompts often improve:

  • Accuracy
  • Clarity
  • Relevance
  • Consistency

Temperature and Randomness

Generative AI systems often include configuration settings such as temperature.

Temperature controls randomness in generated responses.

TemperatureBehavior
Low temperatureMore focused and predictable responses
High temperatureMore creative and varied responses

Example

A low temperature may be used for factual responses, while a higher temperature may be used for creative writing.


Hallucinations

Generative AI models can sometimes produce incorrect or fabricated information called hallucinations.

Example

An AI chatbot may confidently provide false information or invent references.

Hallucinations occur because models generate likely patterns rather than verifying factual truth.

This is an important AI-901 exam concept.


Context Windows

Generative AI models use context windows to determine how much information they can process at one time.

The context window includes:

  • User prompts
  • Previous conversation history
  • Uploaded content
  • Instructions

Larger context windows allow models to handle longer conversations and larger documents.


Retrieval-Augmented Generation (RAG)

Some AI systems use Retrieval-Augmented Generation (RAG).

RAG combines:

  • A generative AI model
  • External knowledge retrieval

Instead of relying only on training data, the model retrieves current or domain-specific information before generating responses.

Benefits

  • More accurate responses
  • Reduced hallucinations
  • Access to updated information

Generative AI Modalities

Generative AI is not limited to text.

Different model types generate different content formats.

Model TypeOutput
Text modelsArticles, conversations, summaries
Image modelsPictures and artwork
Audio modelsSpeech and music
Video modelsVideo clips
Code modelsProgramming code

Responsible AI Considerations

Generative AI systems introduce Responsible AI concerns such as:

  • Bias
  • Hallucinations
  • Harmful content generation
  • Privacy risks
  • Copyright concerns
  • Security risks

Organizations should implement:

  • Human oversight
  • Content filtering
  • Monitoring
  • Transparency
  • Governance policies

Azure and Generative AI

Microsoft Azure AI Services and related Azure AI offerings provide tools for building and deploying generative AI applications.

Microsoft also provides Responsible AI guidance and safety controls for generative AI systems.


Real-World Example

Scenario: AI Customer Support Assistant

A company deploys a generative AI chatbot for customer support.

How It Works

  • Users enter prompts
  • The language model processes tokens
  • The transformer architecture analyzes context
  • The model predicts likely responses
  • The chatbot generates natural language answers

Additional Features

  • Fine-tuned on company documentation
  • Uses RAG to retrieve current policy information
  • Applies content filtering
  • Escalates uncertain cases to humans

This type of scenario aligns well with AI-901 exam questions.


Microsoft Responsible AI and Generative AI

Microsoft emphasizes Responsible AI practices for generative AI systems, including:

  • Fairness
  • Reliability and safety
  • Privacy and security
  • Inclusiveness
  • Transparency
  • Accountability

Generative AI systems should be designed responsibly and monitored carefully.


Important AI-901 Exam Tips

For the exam, remember these key points:

  • Generative AI creates new content rather than only classifying or predicting.
  • Large Language Models (LLMs) generate text by predicting tokens.
  • Tokens are small pieces of text processed by the model.
  • Transformers are the core architecture behind many modern generative AI systems.
  • Foundation models are large pretrained models adaptable to many tasks.
  • Fine-tuning customizes models for specific use cases.
  • Prompts guide model behavior.
  • Temperature controls response randomness.
  • Hallucinations are incorrect or fabricated outputs.
  • RAG combines retrieval systems with generative AI models.

Quick Knowledge Check

Question 1

What is the primary function of generative AI?

Answer

To create new content such as text, images, audio, or code.


Question 2

What is a token in a language model?

Answer

A small unit of text processed by the model.


Question 3

What does temperature control in generative AI?

Answer

The randomness and creativity of generated outputs.


Question 4

What is a hallucination in generative AI?

Answer

An incorrect or fabricated response generated by the model.


Practice Exam Questions

Question 1

What is the PRIMARY purpose of a generative AI model?

A. To classify data into categories only
B. To create new content based on learned patterns
C. To replace all human decision-making
D. To store database records


Correct Answer

B. To create new content based on learned patterns


Explanation

Generative AI models are designed to generate new content such as text, images, audio, code, or video using patterns learned from training data.


Why the Other Answers Are Incorrect

A. To classify data into categories only

Classification is more commonly associated with traditional predictive AI models.

C. To replace all human decision-making

AI should support, not fully replace, human decision-making.

D. To store database records

Databases store data but are not generative AI systems.


Question 2

How do Large Language Models (LLMs) primarily generate text?

A. By copying entire documents from the internet
B. By predicting the next likely token in a sequence
C. By manually selecting words from a dictionary
D. By using spreadsheet formulas


Correct Answer

B. By predicting the next likely token in a sequence


Explanation

LLMs generate text by predicting the most probable next token repeatedly until a full response is created.


Why the Other Answers Are Incorrect

A. By copying entire documents from the internet

LLMs generate responses based on learned patterns rather than simply copying content.

C. By manually selecting words from a dictionary

The process is automated using neural networks.

D. By using spreadsheet formulas

Spreadsheet formulas are unrelated to language generation.


Question 3

What is a token in a generative AI language model?

A. A hardware device used for training
B. A small unit of text processed by the model
C. A cloud storage container
D. A type of encryption key


Correct Answer

B. A small unit of text processed by the model


Explanation

Tokens are pieces of text such as words, parts of words, punctuation, or characters that language models process during training and generation.


Why the Other Answers Are Incorrect

A. A hardware device used for training

Tokens are not physical hardware.

C. A cloud storage container

Storage containers are unrelated.

D. A type of encryption key

Encryption keys are used in security systems.


Question 4

Which neural network architecture powers many modern generative AI systems?

A. Decision trees
B. Transformers
C. Linear regression
D. Rule-based engines


Correct Answer

B. Transformers


Explanation

Transformers are the core architecture behind many modern generative AI systems because they handle context and sequential data effectively.


Why the Other Answers Are Incorrect

A. Decision trees

Decision trees are traditional machine learning models.

C. Linear regression

Linear regression is used for predicting numeric values.

D. Rule-based engines

Rule-based systems do not use transformer architectures.


Question 5

What is the purpose of fine-tuning a generative AI model?

A. To physically repair damaged hardware
B. To adapt a pretrained model for a specialized task or domain
C. To permanently disable model updates
D. To reduce network bandwidth usage


Correct Answer

B. To adapt a pretrained model for a specialized task or domain


Explanation

Fine-tuning customizes a pretrained foundation model using additional domain-specific data to improve performance for particular use cases.


Why the Other Answers Are Incorrect

A. To physically repair damaged hardware

Fine-tuning is a training process, not hardware maintenance.

C. To permanently disable model updates

Fine-tuning modifies model behavior rather than disabling updates.

D. To reduce network bandwidth usage

Bandwidth optimization is unrelated.


Question 6

What does the temperature setting control in many generative AI models?

A. The physical temperature of the server hardware
B. The randomness and creativity of generated responses
C. The amount of training data stored
D. The encryption strength of the model


Correct Answer

B. The randomness and creativity of generated responses


Explanation

Higher temperature values generally produce more creative and varied responses, while lower values produce more predictable outputs.


Why the Other Answers Are Incorrect

A. The physical temperature of the server hardware

Temperature is a model configuration setting, not a hardware measurement.

C. The amount of training data stored

Temperature does not affect stored data size.

D. The encryption strength of the model

Temperature is unrelated to encryption.


Question 7

What is a hallucination in generative AI?

A. A hardware malfunction during training
B. A correct response with high confidence
C. An incorrect or fabricated output generated by the model
D. A type of data encryption


Correct Answer

C. An incorrect or fabricated output generated by the model


Explanation

Hallucinations occur when a generative AI model produces false or misleading information that appears convincing.


Why the Other Answers Are Incorrect

A. A hardware malfunction during training

Hallucinations are output issues, not hardware failures.

B. A correct response with high confidence

Hallucinations are inaccurate responses.

D. A type of data encryption

Hallucinations are unrelated to encryption.


Question 8

What is the PRIMARY purpose of a prompt in generative AI?

A. To physically start a computer server
B. To provide instructions or input to guide model output
C. To encrypt training data
D. To replace model training


Correct Answer

B. To provide instructions or input to guide model output


Explanation

Prompts tell the model what task to perform or what type of response to generate.


Why the Other Answers Are Incorrect

A. To physically start a computer server

Prompts are text inputs, not hardware controls.

C. To encrypt training data

Prompts are unrelated to encryption.

D. To replace model training

Prompts guide trained models but do not replace training.


Question 9

What is Retrieval-Augmented Generation (RAG)?

A. A hardware acceleration technique
B. A method that combines generative AI with external information retrieval
C. A database backup process
D. A data compression algorithm


Correct Answer

B. A method that combines generative AI with external information retrieval


Explanation

RAG improves AI responses by retrieving relevant external information before generating outputs.


Why the Other Answers Are Incorrect

A. A hardware acceleration technique

RAG is not a hardware feature.

C. A database backup process

RAG is unrelated to backups.

D. A data compression algorithm

Compression is unrelated.


Question 10

Which statement BEST describes a foundation model?

A. A small model designed for a single narrow task
B. A large pretrained model adaptable to many AI tasks
C. A hardware device used for AI training
D. A database management system


Correct Answer

B. A large pretrained model adaptable to many AI tasks


Explanation

Foundation models are large AI models trained on massive datasets that can be adapted for many applications, including chatbots, summarization, and image generation.


Why the Other Answers Are Incorrect

A. A small model designed for a single narrow task

Foundation models are broad and highly adaptable.

C. A hardware device used for AI training

Foundation models are software models, not hardware.

D. A database management system

Databases manage data but are not AI models.


Final Thoughts

Generative AI is a major area of modern artificial intelligence and an important topic for the AI-901 certification exam. Microsoft expects candidates to understand the foundational concepts behind how generative AI models work, including tokens, transformers, prompts, training, and model behavior.

Understanding these concepts provides a strong foundation for working with modern AI systems and Azure AI technologies.


Go to the AI-901 Exam Prep Hub main page

Describe considerations for accountability in an AI solution (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Identify AI concepts and capabilities (40–45%)
--> Describe principles of responsible AI
--> Describe considerations for accountability in an AI solution


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Accountability is one of Microsoft’s core Responsible AI principles and an important topic for the AI-901 certification exam. Accountability means that organizations and individuals remain responsible for the design, deployment, operation, and outcomes of AI systems.

Even when AI systems automate decisions or recommendations, humans and organizations are still accountable for how those systems behave and affect people.


What Is Accountability in AI?

Accountability in AI means that organizations must:

  • Take responsibility for AI system behavior
  • Monitor AI systems appropriately
  • Correct problems when issues arise
  • Ensure AI is used ethically and safely
  • Establish governance and oversight processes

AI systems should not operate without human responsibility or organizational oversight.


Why Accountability Matters

AI systems can significantly affect people’s lives in areas such as:

  • Hiring
  • Healthcare
  • Banking
  • Education
  • Insurance
  • Law enforcement
  • Customer service

If an AI system causes harm, produces biased outcomes, or makes incorrect decisions, organizations cannot simply blame the technology.

Humans remain responsible for:

  • Designing the system
  • Choosing training data
  • Setting policies
  • Reviewing outputs
  • Monitoring system performance

Accountability helps ensure organizations use AI responsibly.


Human Responsibility in AI

One of the most important ideas in accountability is that humans remain responsible for AI systems.

AI systems should support human decision-making rather than completely replace accountability.

Example

If an AI system incorrectly denies a loan application, the financial institution remains responsible for addressing the issue.

Organizations cannot avoid responsibility by claiming, “The AI made the decision.”


Governance and Oversight

Organizations should establish governance structures for AI systems.

Governance refers to the policies, processes, and controls used to manage AI responsibly.

Governance Activities Include:

  • Defining acceptable AI usage
  • Reviewing high-risk systems
  • Monitoring model performance
  • Conducting audits
  • Managing compliance requirements
  • Responding to incidents

Strong governance improves accountability and reduces risk.


Human Oversight

Humans should remain involved in reviewing sensitive or high-impact AI decisions.

Examples

  • Doctors reviewing AI-assisted diagnoses
  • Recruiters reviewing hiring recommendations
  • Bank employees reviewing loan decisions

Human oversight helps:

  • Catch errors
  • Detect unfair outcomes
  • Prevent harmful actions
  • Improve trust

Auditability and Record Keeping

Organizations should maintain records about AI systems, including:

  • Training data sources
  • Model versions
  • System decisions
  • Performance metrics
  • Configuration changes
  • User activity logs

These records support:

  • Auditing
  • Troubleshooting
  • Compliance
  • Investigations

Auditability is an important accountability practice.


Monitoring AI Systems

AI systems should be continuously monitored after deployment.

Monitoring helps organizations identify:

  • Bias
  • Reliability issues
  • Security threats
  • Performance degradation
  • Unexpected behavior

Without monitoring, harmful issues may go unnoticed.


Incident Response

Organizations should prepare for situations where AI systems fail or behave improperly.

Example

If an AI chatbot begins generating harmful responses, the organization should have procedures for:

  • Disabling the system
  • Investigating the issue
  • Correcting the problem
  • Communicating with affected users

Accountability includes responding appropriately when problems occur.


Accountability in Generative AI

Generative AI introduces additional accountability challenges.

Organizations using generative AI should consider:

  • Content moderation
  • Human review
  • Usage policies
  • Monitoring outputs
  • Preventing misuse
  • Handling hallucinations and misinformation

Example

A company deploying an AI writing assistant remains responsible for ensuring harmful or misleading content is not distributed.


Legal and Ethical Responsibility

Organizations may face legal or regulatory consequences if AI systems:

  • Violate privacy laws
  • Discriminate unfairly
  • Cause financial harm
  • Create safety risks

Accountability helps ensure compliance with:

  • Industry regulations
  • Ethical standards
  • Internal policies

Shared Accountability

AI accountability is often shared across multiple groups, including:

  • Executives
  • Developers
  • Data scientists
  • Security teams
  • Compliance officers
  • Business stakeholders

Responsible AI requires collaboration across the organization.


Real-World Example

Scenario: AI Hiring System

A company uses AI to screen job applicants.

Accountability Risks

  • Biased hiring recommendations
  • Lack of human review
  • Poor documentation
  • Unclear responsibility for decisions

Accountability Practices

  • Human recruiter review
  • Audit logs
  • Regular fairness testing
  • Clear governance policies
  • Transparency with applicants
  • Monitoring system performance

Result

The organization maintains responsibility for hiring decisions rather than relying blindly on AI outputs.

This type of scenario aligns well with AI-901 exam questions.


Accountability and Transparency

Transparency and accountability are closely connected.

Transparency helps organizations:

  • Understand AI behavior
  • Investigate decisions
  • Explain outcomes
  • Support audits

Without transparency, accountability becomes more difficult.


Accountability and Human-in-the-Loop Systems

Human-in-the-loop systems require humans to participate in or approve AI-driven decisions.

Example

An AI fraud detection system flags suspicious transactions, but human analysts make the final decision to freeze accounts.

This approach improves accountability in high-risk scenarios.


Microsoft Responsible AI Principles

Microsoft identifies accountability as one of six Responsible AI principles:

  1. Fairness
  2. Reliability and safety
  3. Privacy and security
  4. Inclusiveness
  5. Transparency
  6. Accountability

For AI-901, understand that accountability focuses on ensuring humans and organizations remain responsible for AI systems and their outcomes.


Best Practices for Accountability in AI

Organizations commonly improve accountability through:


Governance Frameworks

Establish policies and procedures for responsible AI usage.


Human Oversight

Keep humans involved in sensitive decisions.


Monitoring and Auditing

Regularly review AI system behavior and maintain records.


Clear Roles and Responsibilities

Define who is responsible for:

  • Development
  • Deployment
  • Monitoring
  • Incident response

Documentation

Document model behavior, limitations, and risks.


Incident Management

Prepare procedures for handling AI failures or harmful outputs.


Azure and Responsible AI

Microsoft Azure AI Services and related Microsoft AI platforms provide tools and guidance that support accountability, including:

  • Monitoring tools
  • Governance capabilities
  • Logging and auditing features
  • Responsible AI guidance
  • Security and compliance controls

Microsoft encourages organizations to build AI systems with strong governance and human responsibility.


Important AI-901 Exam Tips

For the exam, remember these key points:

  • Humans and organizations remain responsible for AI outcomes.
  • AI systems should not operate without oversight.
  • Governance frameworks support accountability.
  • Human oversight is important in sensitive scenarios.
  • Monitoring and auditing improve accountability.
  • Incident response plans help manage AI failures.
  • Generative AI requires additional governance and monitoring.
  • Accountability is one of Microsoft’s six Responsible AI principles.

Quick Knowledge Check

Question 1

What does accountability mean in AI?

Answer

Organizations and individuals remain responsible for AI systems and their outcomes.


Question 2

Why is human oversight important for accountability?

Answer

Humans can review, validate, and correct AI decisions when necessary.


Question 3

What is auditability in AI?

Answer

The ability to review records, logs, and system behavior for investigation and compliance purposes.


Question 4

Why are governance frameworks important in AI?

Answer

They establish policies, controls, and responsibilities for responsible AI management.


Practice Exam Questions

Question 1

An organization deploys an AI system that denies loan applications automatically. A customer asks who is responsible for the decision.

What is the MOST appropriate answer?

A. The AI model is fully responsible for the decision
B. No one is responsible once the system is deployed
C. The organization that deployed the AI system is responsible
D. Responsibility is shared only with the cloud provider


Correct Answer

C. The organization that deployed the AI system is responsible


Explanation

Accountability in AI means that organizations remain responsible for AI system outcomes, even if decisions are automated.

AI does not remove human or organizational responsibility.


Why the Other Answers Are Incorrect

A. The AI model is fully responsible for the decision

AI systems are tools, not accountable entities.

B. No one is responsible once the system is deployed

Responsibility always remains with humans and organizations.

D. Responsibility is shared only with the cloud provider

Cloud providers are not responsible for how customers use AI outputs.


Question 2

What is the PRIMARY goal of accountability in AI?

A. Increasing model accuracy
B. Ensuring humans and organizations remain responsible for AI outcomes
C. Removing the need for monitoring
D. Eliminating all bias automatically


Correct Answer

B. Ensuring humans and organizations remain responsible for AI outcomes


Explanation

Accountability ensures that responsibility for AI behavior is clearly assigned and maintained.


Why the Other Answers Are Incorrect

A. Increasing model accuracy

Accuracy relates to model performance, not accountability.

C. Removing the need for monitoring

Monitoring is essential for accountability.

D. Eliminating all bias automatically

Bias reduction is part of fairness, not accountability.


Question 3

Which practice BEST supports accountability in an AI system?

A. Deleting system logs regularly
B. Maintaining audit logs of AI decisions and system activity
C. Preventing human access to AI outputs
D. Disabling model monitoring


Correct Answer

B. Maintaining audit logs of AI decisions and system activity


Explanation

Audit logs provide traceability and help organizations investigate and review AI system behavior.


Why the Other Answers Are Incorrect

A. Deleting system logs regularly

This reduces traceability.

C. Preventing human access to AI outputs

Human review is important for accountability.

D. Disabling model monitoring

Monitoring is essential for responsible AI.


Question 4

Why is human oversight important in AI systems?

A. It guarantees zero system failures
B. It ensures humans can review and correct AI decisions
C. It removes the need for data storage
D. It increases model training speed


Correct Answer

B. It ensures humans can review and correct AI decisions


Explanation

Human oversight helps ensure accountability by allowing people to intervene when AI systems make incorrect or harmful decisions.


Why the Other Answers Are Incorrect

A. It guarantees zero system failures

No system can guarantee zero failures.

C. It removes the need for data storage

Data storage is still required.

D. It increases model training speed

Human oversight is unrelated to training speed.


Question 5

A company uses an AI system to recommend job candidates but does not track how the model makes decisions or logs outputs.

What accountability issue does this MOST likely create?

A. Lack of auditability
B. Excessive transparency
C. Improved governance
D. Increased fairness


Correct Answer

A. Lack of auditability


Explanation

Without logs or records, it is difficult to trace decisions or investigate issues, reducing accountability.


Why the Other Answers Are Incorrect

B. Excessive transparency

Transparency is not the issue here.

C. Improved governance

This scenario reduces governance effectiveness.

D. Increased fairness

Lack of tracking does not improve fairness.


Question 6

What is incident response in AI accountability?

A. Increasing training dataset size
B. A process for handling AI failures or harmful outputs
C. A method for improving model speed
D. A technique for compressing data


Correct Answer

B. A process for handling AI failures or harmful outputs


Explanation

Incident response ensures organizations can quickly address and correct problems caused by AI systems.


Why the Other Answers Are Incorrect

A. Increasing training dataset size

This is unrelated to incident handling.

C. A method for improving model speed

Performance optimization is separate.

D. A technique for compressing data

Compression is unrelated.


Question 7

Which statement BEST describes accountability in AI?

A. AI systems are responsible for their own decisions
B. Developers and organizations remain responsible for AI outcomes
C. Cloud providers are fully responsible for all AI usage
D. Accountability is optional in AI systems


Correct Answer

B. Developers and organizations remain responsible for AI outcomes


Explanation

Accountability ensures humans and organizations are responsible for AI system behavior and consequences.


Why the Other Answers Are Incorrect

A. AI systems are responsible for their own decisions

AI is not an accountable entity.

C. Cloud providers are fully responsible for all AI usage

Responsibility lies with the organization using the system.

D. Accountability is optional in AI systems

It is a core Responsible AI principle.


Question 8

Which activity is MOST directly related to AI governance?

A. Writing marketing copy
B. Defining policies for responsible AI use and oversight
C. Increasing GPU performance
D. Compressing training data


Correct Answer

B. Defining policies for responsible AI use and oversight


Explanation

Governance includes policies, procedures, and controls that ensure AI systems are used responsibly.


Why the Other Answers Are Incorrect

A. Writing marketing copy

This is unrelated to governance.

C. Increasing GPU performance

This is a technical optimization task.

D. Compressing training data

This is a data engineering task.


Question 9

Why is documentation important for AI accountability?

A. It replaces the need for monitoring
B. It helps track system behavior, limitations, and decisions
C. It guarantees perfect model accuracy
D. It eliminates the need for human review


Correct Answer

B. It helps track system behavior, limitations, and decisions


Explanation

Documentation supports transparency and accountability by providing a record of how the AI system was built and behaves.


Why the Other Answers Are Incorrect

A. It replaces the need for monitoring

Monitoring is still required.

C. It guarantees perfect model accuracy

Documentation does not affect accuracy.

D. It eliminates the need for human review

Human review remains important.


Question 10

Which Microsoft Responsible AI principle focuses on ensuring responsibility for AI systems and their outcomes?

A. Fairness
B. Accountability
C. Transparency
D. Inclusiveness


Correct Answer

B. Accountability


Explanation

Accountability ensures that humans and organizations remain responsible for AI systems, including their design, deployment, and impact.


Why the Other Answers Are Incorrect

A. Fairness

Fairness focuses on avoiding bias and discrimination.

C. Transparency

Transparency focuses on explainability.

D. Inclusiveness

Inclusiveness focuses on accessibility and diverse users.


Final Thoughts

Accountability is a foundational Responsible AI principle and a key topic for the AI-901 certification exam. Microsoft expects candidates to understand that organizations remain responsible for the behavior and impact of AI systems, even when decisions are automated.

Strong accountability practices help organizations manage risk, improve trust, support compliance, and ensure AI technologies are used responsibly and ethically.


Go to the AI-901 Exam Prep Hub main page

Describe considerations for transparency in an AI solution (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub.
This topic falls under these sections:
Identify AI concepts and capabilities (40–45%)
–> Describe principles of responsible AI
–> Describe considerations for transparency in an AI solution


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.


Transparency is one of Microsoft’s core Responsible AI principles and an important topic for the AI-901 certification exam. Transparency helps ensure that people understand when AI is being used, how AI systems make decisions, and what limitations or risks may exist.

Transparent AI systems help build trust, improve accountability, and support ethical decision-making.


What Is Transparency in AI?

Transparency in AI means that users and stakeholders should have appropriate visibility into:

  • When AI is being used
  • How AI systems make decisions
  • What data is being used
  • The capabilities and limitations of the AI system
  • The potential risks associated with the system

Transparency helps organizations avoid “black box” AI systems where decisions cannot be reasonably understood or explained.


Why Transparency Matters

AI systems increasingly influence important decisions in areas such as:

  • Healthcare
  • Banking
  • Hiring
  • Education
  • Insurance
  • Customer service
  • Government services

If users do not understand how AI systems operate, they may:

  • Lose trust in the system
  • Be unable to challenge incorrect decisions
  • Fail to identify bias or errors
  • Misuse the technology
  • Rely too heavily on inaccurate outputs

Transparent systems help users make informed decisions about how and when to use AI outputs.


Explainability in AI

One of the most important aspects of transparency is explainability.

Explainability refers to the ability to understand why an AI model made a specific decision or prediction.

Example

If an AI system denies a loan application, the organization should be able to explain the factors that influenced the decision.

Explainability is especially important in high-impact scenarios.


Black Box AI Systems

Some AI models, especially advanced deep learning systems, can be difficult to interpret.

These are sometimes called black box models because:

  • Their internal decision-making process is difficult to understand
  • Humans may not easily determine why a prediction was made

While highly complex models may offer strong performance, they can create transparency challenges.


Informing Users About AI Usage

Organizations should clearly communicate when users are interacting with AI systems.

Example

A chatbot should disclose that it is AI-powered rather than pretending to be a human agent.

Users should understand:

  • They are interacting with AI
  • AI-generated responses may contain errors
  • Human review may still be necessary

Transparency About Data Usage

Organizations should explain:

  • What data is collected
  • Why the data is collected
  • How the data is used
  • How long the data is retained
  • Who has access to the data

This supports both transparency and privacy goals.


Transparency in Generative AI

Generative AI systems create additional transparency considerations.

Users should understand that generated content may:

  • Be inaccurate
  • Contain hallucinations
  • Reflect bias
  • Be incomplete
  • Require verification

Example

An AI-generated summary should not automatically be assumed to be completely accurate without review.

Organizations should avoid presenting AI-generated information as guaranteed fact.


Model Documentation

Transparent AI systems often include documentation that explains:

  • Model purpose
  • Intended use cases
  • Training data sources
  • Known limitations
  • Performance characteristics
  • Risks and ethical considerations

Good documentation improves trust and accountability.


Human Interpretability

AI outputs should be understandable to the people using them whenever possible.

Example

A medical AI system may provide:

  • Confidence scores
  • Highlighted image regions
  • Explanations of risk factors

These explanations help doctors understand and validate the results.


Transparency and Trust

Transparency helps build trust because users are more likely to trust systems they understand.

Transparent AI systems help users:

  • Recognize limitations
  • Identify errors
  • Use AI responsibly
  • Make informed decisions

Lack of transparency can lead to skepticism, misuse, or overreliance on AI outputs.


Transparency vs. Complexity

There can be trade-offs between model complexity and explainability.

Example

A simple decision tree model may be easier to explain than a large neural network.

Organizations must balance:

  • Accuracy
  • Performance
  • Interpretability
  • Business requirements

In some high-risk scenarios, explainability may be more important than maximum predictive performance.


Real-World Example

Scenario: AI Loan Approval System

A bank uses AI to evaluate loan applications.

Transparency Requirements

  • Explain why applications are approved or denied
  • Inform users AI is assisting with decisions
  • Provide understandable explanations
  • Document model limitations
  • Allow human review of disputed decisions

Potential Risks Without Transparency

  • Customers may not understand denials
  • Hidden bias may go undetected
  • Regulators may raise compliance concerns
  • Trust in the system may decrease

Possible Solutions

  • Explainable AI tools
  • Human oversight
  • Model documentation
  • User communication
  • Decision summaries

This type of scenario aligns well with AI-901 exam questions.


Explainable AI (XAI)

Explainable AI (XAI) refers to techniques that help humans understand AI behavior.

XAI techniques may provide:

  • Feature importance
  • Confidence scores
  • Visual explanations
  • Decision summaries

These tools improve transparency and accountability.


Transparency in Microsoft Responsible AI

Microsoft identifies transparency as one of six Responsible AI principles:

  1. Fairness
  2. Reliability and safety
  3. Privacy and security
  4. Inclusiveness
  5. Transparency
  6. Accountability

For AI-901, understand that transparency focuses on making AI systems understandable and explainable.


Best Practices for Transparency in AI

Organizations commonly improve transparency through:


Clear User Communication

Tell users when AI is being used and explain system limitations.


Explainable Models

Use explainability techniques where appropriate.


Documentation

Maintain documentation about:

  • Data sources
  • Intended usage
  • Limitations
  • Risks

Human Oversight

Allow humans to review important AI decisions.


User Education

Help users understand:

  • What the AI can do
  • What it cannot do
  • When human judgment is needed

Monitoring and Auditing

Review AI decisions regularly to identify issues or unexpected behavior.


Azure and Transparency

Microsoft Azure AI Services and related Microsoft AI platforms provide tools and guidance to support transparency, including:

  • Responsible AI documentation
  • Explainability tools
  • Model evaluation features
  • Governance frameworks
  • Monitoring capabilities

Microsoft encourages organizations to design AI systems that users can understand and trust.


Important AI-901 Exam Tips

For the exam, remember these key points:

  • Transparency means making AI systems understandable and explainable.
  • Users should know when they are interacting with AI.
  • Explainability helps users understand AI decisions.
  • Black box models can create transparency challenges.
  • Transparency builds trust and accountability.
  • Generative AI outputs may require verification.
  • Documentation supports transparency.
  • Transparency is one of Microsoft’s six Responsible AI principles.

Quick Knowledge Check

Question 1

What is explainability in AI?

Answer

The ability to understand why an AI model made a specific decision or prediction.


Question 2

Why should users know when they are interacting with AI?

Answer

So they can make informed decisions and understand the limitations of the system.


Question 3

What is a black box AI model?

Answer

A model whose internal decision-making process is difficult to understand or explain.


Question 4

Why is transparency important in generative AI?

Answer

Because generated content may contain inaccuracies, hallucinations, or bias that users should recognize.


Practice Exam Questions

Question 1

A bank uses an AI model to evaluate loan applications. Customers can request an explanation of why their application was denied.

What Responsible AI concept does this BEST demonstrate?

A. Scalability
B. Explainability
C. Data compression
D. Batch processing


Correct Answer

B. Explainability


Explanation

Explainability refers to the ability to understand and communicate why an AI system made a specific decision or prediction.

This is an important aspect of transparency.


Why the Other Answers Are Incorrect

A. Scalability

Scalability refers to handling increased workloads.

C. Data compression

Compression reduces file size.

D. Batch processing

Batch processing refers to grouped data operations.


Question 2

What is the PRIMARY goal of transparency in AI?

A. Increasing hardware performance
B. Making AI systems understandable and explainable
C. Eliminating the need for documentation
D. Preventing all system failures


Correct Answer

B. Making AI systems understandable and explainable


Explanation

Transparency helps users and stakeholders understand how AI systems operate, make decisions, and use data.


Why the Other Answers Are Incorrect

A. Increasing hardware performance

Hardware optimization is unrelated to transparency.

C. Eliminating the need for documentation

Documentation supports transparency.

D. Preventing all system failures

Reliability and safety focus on system failures.


Question 3

Why should users be informed when interacting with an AI chatbot?

A. To improve internet speed
B. To help users understand they are communicating with AI-generated responses
C. To eliminate the need for security controls
D. To reduce storage requirements


Correct Answer

B. To help users understand they are communicating with AI-generated responses


Explanation

Transparency requires organizations to disclose AI usage so users can make informed decisions and understand system limitations.


Why the Other Answers Are Incorrect

A. To improve internet speed

Network speed is unrelated to transparency.

C. To eliminate the need for security controls

Security controls remain important.

D. To reduce storage requirements

Storage optimization is unrelated.


Question 4

What is a “black box” AI model?

A. A model with encrypted outputs
B. A model whose internal decision-making process is difficult to interpret
C. A model designed only for security applications
D. A model that stores data offline


Correct Answer

B. A model whose internal decision-making process is difficult to interpret


Explanation

Black box models are AI systems whose internal logic is difficult for humans to understand or explain.


Why the Other Answers Are Incorrect

A. A model with encrypted outputs

Encryption relates to security.

C. A model designed only for security applications

Black box models are not limited to security scenarios.

D. A model that stores data offline

Offline storage is unrelated to explainability.


Question 5

Which practice BEST improves transparency in an AI solution?

A. Hiding model limitations from users
B. Providing documentation about how the model works and its limitations
C. Removing human oversight
D. Disabling monitoring systems


Correct Answer

B. Providing documentation about how the model works and its limitations


Explanation

Clear documentation helps users and stakeholders understand AI capabilities, intended uses, risks, and limitations.


Why the Other Answers Are Incorrect

A. Hiding model limitations from users

Transparency requires openness about limitations.

C. Removing human oversight

Human oversight often supports Responsible AI.

D. Disabling monitoring systems

Monitoring helps maintain accountability and reliability.


Question 6

Why is transparency especially important in generative AI systems?

A. Generative AI never produces incorrect information
B. Users should understand that generated content may contain inaccuracies or bias
C. Transparency guarantees perfect model accuracy
D. Transparency removes all security risks


Correct Answer

B. Users should understand that generated content may contain inaccuracies or bias


Explanation

Generative AI systems can hallucinate facts, produce biased content, or generate misleading information. Users should understand these limitations.


Why the Other Answers Are Incorrect

A. Generative AI never produces incorrect information

Generative AI can produce inaccurate outputs.

C. Transparency guarantees perfect model accuracy

Transparency does not guarantee accuracy.

D. Transparency removes all security risks

Security risks still exist.


Question 7

A medical AI system highlights regions of an X-ray image that influenced its diagnosis recommendation.

What transparency technique is this demonstrating?

A. Explainable AI
B. Data poisoning
C. Encryption
D. Data normalization


Correct Answer

A. Explainable AI


Explanation

Explainable AI techniques help users understand how an AI system reached a conclusion.

Visual explanations are a common explainability method.


Why the Other Answers Are Incorrect

B. Data poisoning

Data poisoning is a malicious attack on training data.

C. Encryption

Encryption protects data confidentiality.

D. Data normalization

Normalization prepares data for analysis.


Question 8

Which Microsoft Responsible AI principle focuses on making AI systems understandable?

A. Fairness
B. Transparency
C. Inclusiveness
D. Reliability and safety


Correct Answer

B. Transparency


Explanation

The Transparency principle focuses on explainability, openness, and helping users understand AI systems and decisions.


Why the Other Answers Are Incorrect

A. Fairness

Fairness focuses on avoiding unjust bias.

C. Inclusiveness

Inclusiveness focuses on accessibility and diverse users.

D. Reliability and safety

Reliability and safety focus on dependable and safe operation.


Question 9

Why might organizations choose a simpler AI model instead of a more complex model?

A. Simpler models may be easier to explain and interpret
B. Simpler models always provide higher accuracy
C. Complex models cannot process data
D. Simpler models remove all privacy concerns


Correct Answer

A. Simpler models may be easier to explain and interpret


Explanation

There is often a trade-off between model complexity and explainability. Simpler models may improve transparency in sensitive scenarios.


Why the Other Answers Are Incorrect

B. Simpler models always provide higher accuracy

Complex models may sometimes be more accurate.

C. Complex models cannot process data

Complex models are commonly used in AI.

D. Simpler models remove all privacy concerns

Privacy concerns may still exist regardless of model complexity.


Question 10

What is one major benefit of transparency in AI systems?

A. Transparency eliminates the need for testing
B. Transparency helps build user trust and accountability
C. Transparency guarantees compliance with all laws
D. Transparency removes the need for human oversight


Correct Answer

B. Transparency helps build user trust and accountability


Explanation

When users understand how AI systems work and what their limitations are, they are more likely to trust and responsibly use the technology.


Why the Other Answers Are Incorrect

A. Transparency eliminates the need for testing

Testing remains necessary.

C. Transparency guarantees compliance with all laws

Compliance still requires governance and policy controls.

D. Transparency removes the need for human oversight

Human oversight may still be necessary in many scenarios.


Final Thoughts

Transparency is a foundational Responsible AI principle and a key topic for the AI-901 certification exam. Microsoft expects candidates to understand why explainability, communication, and openness are important in AI systems.

Transparent AI solutions help organizations build trust, improve accountability, and enable users to make informed decisions when interacting with AI technologies.


Go to the AI-901 Exam Prep Hub main page