Tag: Azure AI Vision Service

AI, Microsoft Certification, Azure AI, AB-731 June 12, 2026

Identify capabilities of Azure AI services, including Azure AI Vision in Foundry Tools, Azure AI Search, and Microsoft Foundry (AB-731 Exam Prep)

This post is a part of the AB-731: AI Transformation Leader Exam Prep Hub.
This topic falls under these sections:
Identify benefits, capabilities, and opportunities for Microsoft’s AI apps and services (35–40%)
   --> Identify benefits and capabilities of Foundry Tools
      --> Identify capabilities of Azure AI services, including Azure AI Vision in Foundry Tools, Azure AI Search, and Microsoft Foundry

Note that there are 10 practice questions (with answers) at the end of each section to help you solidify your knowledge of the material. Also, there are 4 practice tests with 30 questions each available from the hub's main page below the exam topics section.

Introduction

One of the objectives in the AB-731: AI Transformation Leader exam is understanding how Microsoft’s AI platform capabilities can be applied to business problems. Leaders are not expected to build these solutions themselves, but they should understand which services are available, what problems they solve, and how they create business value.

This topic focuses on:

Azure AI Vision
Azure AI Search
Microsoft Foundry (Azure AI Foundry)
How these services work together to create enterprise AI solutions

Understanding Microsoft’s AI Platform

Microsoft provides a collection of AI services that allow organizations to:

Analyze images and documents
Search and retrieve organizational knowledge
Build generative AI applications
Create intelligent agents
Ground AI responses with enterprise data
Manage AI projects securely and responsibly

These services are available through Microsoft Foundry, which acts as a central environment for building, testing, and managing AI solutions.

Microsoft Foundry Overview

Microsoft Foundry (Azure AI Foundry) is Microsoft’s unified AI platform for developing and managing AI applications.

It provides:

Access to foundation models
Agent development tools
Prompt flows
Evaluation tools
Safety and content filtering
Knowledge grounding capabilities
Integration with Azure AI services
Monitoring and governance capabilities

Business Value

Foundry enables organizations to:

Accelerate AI development
Reduce complexity
Standardize AI projects
Improve governance
Support responsible AI practices
Build custom AI solutions without creating infrastructure from scratch

Azure AI Services

Azure AI services are prebuilt AI capabilities that developers can incorporate into applications.

Examples include:

Service	Purpose
Azure AI Vision	Analyze images and visual content
Azure AI Search	Retrieve and index enterprise information
Speech Services	Speech-to-text and text-to-speech
Language Services	Sentiment analysis, summarization, translation
Document Intelligence	Extract information from forms and documents

These services reduce development effort because organizations can use Microsoft’s pretrained models instead of building their own.

Azure AI Vision

Azure AI Vision enables AI systems to understand images and visual information.

Capabilities include:

Image Analysis

The service can identify:

Objects
People
Text
Colors
Scenes

Example:

A retailer can analyze product images automatically.

Optical Character Recognition (OCR)

AI Vision can extract text from:

Invoices
Receipts
Signs
Printed documents
Images

Example:

Insurance companies can process claim documents automatically.

Image Captioning

The service can generate descriptions of images.

Example:

“Two people sitting at a conference table using laptops.”

This improves accessibility and supports content management.

Spatial Analysis

Organizations can monitor movement and occupancy.

Example:

Retail stores can analyze customer traffic patterns.

Face Detection (Limited Scenarios)

AI Vision can locate faces in images, although Microsoft follows responsible AI principles and restricts facial recognition capabilities.

Azure AI Vision Within Foundry Tools

Inside Microsoft Foundry, AI Vision can become part of larger AI workflows.

For example:

Upload an image.
Extract text using OCR.
Store results.
Use generative AI to summarize findings.
Present insights to users.

Business scenarios include:

Manufacturing

Defect detection
Quality control

Healthcare

Medical image support
Document digitization

Retail

Shelf monitoring
Product identification

Finance

Receipt processing
Expense automation

Azure AI Search

Azure AI Search is Microsoft’s enterprise search and retrieval platform.

It helps AI systems locate information from:

Documents
PDFs
Databases
Websites
Knowledge bases
SharePoint repositories

The service indexes content so information can be retrieved quickly.

Key Capabilities of Azure AI Search

1. Full-Text Search

Users can search documents using keywords.

Example:

“Show all contracts mentioning renewal dates.”

2. Semantic Search

Instead of matching only keywords, semantic search understands meaning.

Example:

Searching:

“Vacation rules”

may return documents titled:

“Employee Leave Policy”

3. Vector Search

Vector search finds content based on similarity rather than exact wording.

This capability is especially important for:

Generative AI
Retrieval-Augmented Generation (RAG)
Copilot solutions

4. Hybrid Search

Hybrid search combines:

Keyword search
Semantic search
Vector search

This produces more accurate results.

5. Security Trimming

Search results can respect existing permissions.

Users only see content they are authorized to access.

This is critical for enterprise AI systems.

Azure AI Search and RAG

One of the most important uses of Azure AI Search is supporting Retrieval-Augmented Generation (RAG).

RAG process:

User asks a question.
AI Search retrieves relevant information.
Retrieved documents ground the model.
The LLM generates a response based on company data.

Benefits:

Fewer hallucinations
More accurate responses
Current organizational information
Improved trust

Microsoft Foundry Capabilities

Model Catalog

Organizations can choose from multiple AI models.

Examples include:

OpenAI models
Microsoft models
Third-party models

Agent Development

Foundry supports creation of AI agents that can:

Perform tasks
Access data
Use tools
Execute workflows

Prompt Flow

Prompt Flow enables teams to:

Design prompts
Test prompts
Evaluate outputs
Optimize AI applications

Evaluations

Organizations can measure:

Accuracy
Relevance
Safety
Groundedness

This helps improve AI quality.

Responsible AI Features

Foundry includes:

Content filtering
Safety systems
Monitoring
Governance capabilities

These features help organizations implement responsible AI.

Data Grounding

Foundry integrates with:

Azure AI Search
Databases
Documents
External systems

Grounding improves response quality and reduces hallucinations.

Example End-to-End Scenario

A legal organization builds an AI assistant.

Step 1

Contracts are stored in SharePoint.

Step 2

Azure AI Search indexes documents.

Step 3

A user asks:

“Which contracts expire next quarter?”

Step 4

Relevant documents are retrieved.

Step 5

The language model generates an answer.

Step 6

Foundry applies safety controls and monitoring.

Result:

A secure, enterprise-grade AI assistant.

When to Use Each Service

Need	Recommended Service
Image analysis	Azure AI Vision
OCR and text extraction	Azure AI Vision
Enterprise search	Azure AI Search
RAG applications	Azure AI Search
Model management	Microsoft Foundry
Agent development	Microsoft Foundry
AI governance	Microsoft Foundry
Evaluation and prompt testing	Microsoft Foundry

Key Exam Tips

Remember:

Azure AI Vision analyzes images and extracts text.
Azure AI Search retrieves and indexes enterprise knowledge.
Vector search and semantic search support RAG solutions.
Microsoft Foundry provides a unified AI development environment.
Foundry includes safety, evaluation, monitoring, and governance capabilities.
Azure AI services provide pretrained AI capabilities that reduce development effort.
These services work together to create enterprise AI solutions.

Practice Exam Questions

Question 1

A company wants to extract text from scanned invoices and automate expense processing. Which service should they primarily use?

A. Azure AI Search
B. Azure AI Vision
C. Microsoft Foundry Agent Service
D. Microsoft Fabric

Answer: B

Explanation:
Azure AI Vision provides OCR capabilities that can extract text from receipts and scanned documents.

A is incorrect because Search retrieves information rather than extracting text from images.
C is incorrect because agents use information but do not perform OCR directly.
D is incorrect because Fabric focuses on analytics and data workloads.

Question 2

Which capability of Azure AI Search helps retrieve documents based on meaning rather than exact keywords?

A. Full-text indexing
B. OCR
C. Semantic search
D. Content filtering

Answer: C

Explanation:
Semantic search understands context and intent, allowing related documents to be returned even when exact words differ.

A relies on keywords.
B belongs to Vision services.
D is a safety capability.

Question 3

What is a primary purpose of Microsoft Foundry?

A. Replacing Azure subscriptions
B. Serving as a unified environment for building and managing AI applications
C. Acting as a database engine
D. Providing endpoint security

Answer: B

Explanation:
Microsoft Foundry centralizes model access, prompt engineering, evaluations, governance, and AI application development.

A, C, and D describe unrelated technologies.

Question 4

Which search capability is especially important for Retrieval-Augmented Generation (RAG)?

A. Vector search
B. OCR
C. Batch processing
D. Image captioning

Answer: A

Explanation:
Vector search enables similarity-based retrieval, which is foundational to RAG systems.

B and D are Vision features.
C is unrelated.

Question 5

An organization wants AI responses to respect document permissions so employees only see authorized information. Which capability supports this requirement?

A. Image analysis
B. Prompt Flow
C. Security trimming
D. Caption generation

Answer: C

Explanation:
Security trimming ensures search results honor existing access permissions.

A and D are Vision capabilities.
B manages prompts rather than permissions.

Question 6

Which Microsoft service is primarily responsible for analyzing image content?

A. Azure AI Search
B. Microsoft Purview
C. Microsoft Defender for Cloud
D. Azure AI Vision

Answer: D

Explanation:
Azure AI Vision provides image analysis, OCR, and captioning capabilities.

The other services serve different purposes.

Question 7

What is one benefit of grounding generative AI with Azure AI Search?

A. Eliminates all security requirements
B. Removes the need for prompts
C. Reduces hallucinations and improves answer accuracy
D. Replaces foundation models

Answer: C

Explanation:
Grounding with enterprise data helps AI provide more reliable responses.

A, B, and D are incorrect.

Question 8

Which capability is provided directly by Microsoft Foundry?

A. Road traffic navigation
B. Prompt evaluation and testing
C. Firewall management
D. Email hosting

Answer: B

Explanation:
Foundry includes prompt flow and evaluation tools to improve AI quality.

The remaining options are unrelated.

Question 9

A retailer wants AI to identify products shown in photographs. Which service is most appropriate?

A. Azure AI Vision
B. Azure AI Search
C. Azure Virtual Desktop
D. Microsoft Intune

Answer: A

Explanation:
Image analysis capabilities in Azure AI Vision can recognize objects and visual content.

B retrieves documents.
C and D are endpoint technologies.

Question 10

Which combination best supports an enterprise RAG solution?

A. Azure AI Vision + Microsoft Intune
B. Power BI + Defender for Endpoint
C. Azure Virtual Network + Entra ID
D. Azure AI Search + Microsoft Foundry

Answer: D

Explanation:
Azure AI Search retrieves organizational information, while Microsoft Foundry provides the AI platform, models, and orchestration capabilities required to deliver grounded AI experiences.

The other combinations do not provide complete RAG functionality.

Go to the AB-731 Exam Prep Hub main page

AI, AI-901, Computer Vision, Microsoft Certification May 18, 2026

Build a lightweight application that includes vision capabilities (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Implement AI solutions by using Microsoft Foundry (55–60%)
   --> Implement AI solutions with computer vision and image-generation capabilities by using Foundry
      --> Build a lightweight application that includes vision capabilities

Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Computer vision enables AI systems to interpret and analyze visual information such as images and videos. Organizations use computer vision solutions for automation, accessibility, security, analytics, and customer experiences.

For the AI-901 certification exam, candidates should understand the foundational concepts behind building lightweight applications that include vision capabilities by using Microsoft Azure AI services and Azure AI Foundry.

This topic falls under the “Implement AI solutions with computer vision and image-generation capabilities by using Foundry” section of the AI-901 exam objectives.

What Is Computer Vision?

Computer vision is a field of AI that enables systems to analyze and understand visual information.

Visual data may include:

Images
Videos
Scanned documents
Camera feeds

Common Computer Vision Tasks

Computer vision systems commonly perform:

Image classification
Object detection
Optical character recognition (OCR)
Facial analysis
Image captioning
Content moderation

Azure AI Vision

Azure AI Vision provides computer vision capabilities through cloud-based AI services.

Features include:

Image analysis
OCR
Object detection
Image captioning
Facial attribute analysis

What Is a Lightweight Application?

A lightweight application is a simple application designed to perform focused tasks with minimal complexity and infrastructure.

Characteristics include:

Simple user interface
Fast deployment
Minimal resource usage
Easy maintenance

Examples of Lightweight Vision Applications

Examples include:

Image analysis tools
Receipt scanning apps
Accessibility assistants
Product recognition apps
Photo-tagging systems

Azure AI Foundry

Azure AI Foundry provides tools for building, testing, and managing AI-powered applications.

Developers can:

Access AI models
Deploy services
Test prompts
Build AI workflows

Image Classification

Image classification identifies the main category or subject of an image.

Example

Image

Photo of a bicycle

Classification

“Bicycle”

Object Detection

Object detection identifies multiple objects and their locations within an image.

Example

Image

Street scene

Detected Objects

Car
Traffic light
Pedestrian
Bicycle

Optical Character Recognition (OCR)

OCR extracts text from images and scanned documents.

Example

Image

Photo of a restaurant menu

Extracted Text

Menu items and prices

Image Captioning

Image captioning generates natural-language descriptions of images.

Example

Image

A dog playing in a park

Caption

“A brown dog running through a grassy park.”

Facial Analysis

Computer vision systems can analyze facial features.

Possible capabilities include:

Face detection
Emotion analysis
Age estimation

For Responsible AI reasons, facial recognition and identification systems require careful consideration.

APIs and Endpoints

Applications communicate with Azure AI services using:

APIs
Endpoints

These allow images to be analyzed programmatically.

Authentication

Applications must securely authenticate before accessing Azure AI services.

Common authentication methods include:

API keys
Azure credentials
Managed identities

User Interface Components

A lightweight vision application may include:

Image upload area
Camera capture button
Results display
Image preview

Real-Time Image Processing

Some applications process images in near real time.

Examples include:

Security monitoring
Live object detection
Accessibility tools

Example Workflow

A common workflow includes:

User uploads image
Application sends image to Azure AI Vision
AI service analyzes image
Results are returned
Application displays findings

Example High-Level Pseudocode

			
image = upload_image()
results = analyze_image(image)
display_results(results)

For AI-901, understanding the workflow is more important than memorizing exact syntax.

Common Real-World Scenarios

Scenario 1: Receipt Scanner

Goal

Extract purchase information from receipts.

Features

OCR
Text extraction
Data organization

Scenario 2: Accessibility Assistant

Goal

Describe images for visually impaired users.

Features

Image captioning
OCR
Spoken descriptions

Scenario 3: Product Recognition

Goal

Identify products from photos.

Features

Object detection
Classification
Product lookup

Scenario 4: Content Moderation

Goal

Identify harmful or inappropriate images.

Features

Image analysis
Safety detection
Automated filtering

Responsible AI Considerations

Vision-enabled applications should follow Responsible AI principles.

Key considerations include:

Fairness
Privacy
Transparency
Inclusiveness
Accountability
Security

Privacy Concerns

Images may contain:

Personal data
Faces
Sensitive documents
Location information

Organizations should protect visual data appropriately.

Bias and Fairness

Computer vision systems may perform unevenly across:

Skin tones
Lighting conditions
Demographics
Environmental conditions

Testing and evaluation are important for fairness.

Transparency

Users should understand:

AI is analyzing images
AI-generated results may contain errors
Images may be processed in the cloud

Hallucinations and Errors

Vision systems may occasionally generate:

Incorrect captions
False detections
Inaccurate classifications

These incorrect outputs are sometimes called hallucinations.

Error Handling

Applications should handle:

Invalid image formats
Poor image quality
Authentication failures
Network interruptions
Rate limits

Image Quality Challenges

Computer vision accuracy can decrease with:

Blurry images
Poor lighting
Low resolution
Obstructed objects

Advantages of Vision Applications

Benefits include:

Automation
Faster analysis
Accessibility improvements
Improved customer experiences
Scalable image processing

Limitations of Vision Applications

Challenges include:

Recognition inaccuracies
Bias
Privacy concerns
Variable image quality
Ethical considerations

High-Level Architecture

A simplified architecture often includes:

User interface
Image upload/capture
Azure AI Vision service
AI analysis
Results display

Generative Vision Capabilities

Some modern systems combine:

Computer vision
Generative AI

These multimodal systems can:

Analyze images
Generate descriptions
Answer visual questions
Create new images

Important AI-901 Exam Tips

For the exam, remember these key points:

Computer vision analyzes visual information.
Azure AI Vision provides computer vision capabilities.
OCR extracts text from images.
Object detection identifies multiple objects in images.
Image captioning generates natural-language image descriptions.
APIs and endpoints connect applications to Azure AI services.
Authentication secures service access.
Responsible AI principles apply to computer vision systems.
Image quality affects AI accuracy.
Hallucinations are inaccurate AI-generated outputs.

Quick Knowledge Check

Question 1

What does OCR do?

Answer

Extracts text from images and scanned documents.

Question 2

What is object detection?

Answer

Identifying and locating objects within an image.

Question 3

Why is authentication important?

Answer

It secures access to Azure AI services.

Question 4

What can reduce computer vision accuracy?

Answer

Poor image quality such as blur or low lighting.

Practice Exam Questions

Question 1

What is the PRIMARY purpose of computer vision?

A. To enable AI systems to analyze and understand visual information
B. To increase internet bandwidth
C. To manage database backups
D. To improve keyboard performance

Correct Answer

A. To enable AI systems to analyze and understand visual information

Explanation

Computer vision allows AI systems to process and interpret images, videos, and other visual data.

Why the Other Answers Are Incorrect

B. To increase internet bandwidth

Computer vision does not affect networking speed.

C. To manage database backups

This is unrelated to computer vision.

D. To improve keyboard performance

This is unrelated to AI vision systems.

Question 2

Which Azure service provides computer vision capabilities such as OCR and image analysis?

A. Azure AI Vision
B. Azure Backup
C. Azure Virtual Machines
D. Azure DNS

Correct Answer

A. Azure AI Vision

Explanation

Azure AI Vision provides cloud-based computer vision capabilities including OCR, object detection, and image captioning.

Why the Other Answers Are Incorrect

B. Azure Backup

This is a backup service.

C. Azure Virtual Machines

This provides compute infrastructure.

D. Azure DNS

This is a networking service.

Question 3

What does OCR stand for?

A. Optical Character Recognition
B. Open Cloud Rendering
C. Object Classification Registry
D. Operational Compute Routing

Correct Answer

A. Optical Character Recognition

Explanation

OCR extracts text from images or scanned documents.

Why the Other Answers Are Incorrect

B. Open Cloud Rendering

This is not the meaning of OCR.

C. Object Classification Registry

This is unrelated to OCR.

D. Operational Compute Routing

This is not a computer vision term.

Question 4

What is the PRIMARY purpose of object detection?

A. To identify and locate objects within an image
B. To translate spoken language
C. To summarize long documents
D. To compress image files

Correct Answer

A. To identify and locate objects within an image

Explanation

Object detection identifies multiple objects and their locations inside an image.

Why the Other Answers Are Incorrect

B. To translate spoken language

This is a speech AI task.

C. To summarize long documents

This is a text analysis task.

D. To compress image files

Object detection does not compress files.

Question 5

What does image captioning do?

A. Generates natural-language descriptions of images
B. Converts speech into text
C. Encrypts image files
D. Creates database tables

Correct Answer

A. Generates natural-language descriptions of images

Explanation

Image captioning creates human-readable descriptions of visual content.

Why the Other Answers Are Incorrect

B. Converts speech into text

This is speech recognition.

C. Encrypts image files

Encryption is unrelated to captioning.

D. Creates database tables

This is unrelated to computer vision.

Question 6

How do lightweight vision applications typically communicate with Azure AI services?

A. Through APIs and endpoints
B. Through printer drivers
C. Through monitor settings
D. Through USB-only connections

Correct Answer

A. Through APIs and endpoints

Explanation

Applications use APIs and cloud endpoints to send images and receive AI-generated analysis results.

Why the Other Answers Are Incorrect

B. Through printer drivers

Printers are unrelated to AI communication.

C. Through monitor settings

This is unrelated to cloud AI services.

D. Through USB-only connections

Cloud services use network communication.

Question 7

Why is authentication important when accessing Azure AI Vision services?

A. To secure access to AI resources
B. To increase image brightness
C. To improve keyboard response time
D. To accelerate internet speeds

Correct Answer

A. To secure access to AI resources

Explanation

Authentication helps ensure that only authorized users and applications can access Azure AI services.

Why the Other Answers Are Incorrect

B. To increase image brightness

Authentication does not affect image quality.

C. To improve keyboard response time

This is unrelated to authentication.

D. To accelerate internet speeds

Authentication does not improve network performance.

Question 8

Which Responsible AI concern is especially important in computer vision systems?

A. Protecting personal and sensitive visual information
B. Increasing monitor resolution
C. Improving printer speed
D. Reducing spreadsheet file sizes

Correct Answer

A. Protecting personal and sensitive visual information

Explanation

Images may contain faces, documents, or other sensitive information that must be protected.

Why the Other Answers Are Incorrect

B. Increasing monitor resolution

This is unrelated to Responsible AI.

C. Improving printer speed

Printers are unrelated to computer vision ethics.

D. Reducing spreadsheet file sizes

This is unrelated to image analysis.

Question 9

What challenge can reduce computer vision accuracy?

A. Poor image quality
B. Spreadsheet formatting
C. Keyboard layout changes
D. Audio playback speed

Correct Answer

A. Poor image quality

Explanation

Blur, low lighting, and low resolution can negatively affect image analysis accuracy.

Why the Other Answers Are Incorrect

B. Spreadsheet formatting

This does not affect vision systems.

C. Keyboard layout changes

This is unrelated to image processing.

D. Audio playback speed

This is unrelated to computer vision.

Question 10

What are hallucinations in AI vision systems?

A. Incorrect or fabricated AI-generated outputs
B. Hardware installation failures
C. Network outages
D. Printer connection problems

Correct Answer

A. Incorrect or fabricated AI-generated outputs

Explanation

Hallucinations occur when AI systems generate inaccurate descriptions or detections.

Why the Other Answers Are Incorrect

B. Hardware installation failures

This is unrelated to AI-generated outputs.

C. Network outages

This is a connectivity issue.

D. Printer connection problems

This is unrelated to AI vision systems.

Final Thoughts

Building lightweight applications with vision capabilities is an important topic for the AI-901 certification exam. Microsoft expects candidates to understand the foundational concepts behind computer vision applications, including image classification, object detection, OCR, APIs, authentication, Responsible AI principles, and real-world implementation workflows.

Azure AI Vision and Azure AI Foundry provide powerful cloud-based tools that make it easier to build intelligent applications capable of analyzing and understanding visual information.

Go to the AI-901 Exam Prep Hub main page

AI, AI-900, Artificial Intelligence (AI), Computer Vision, Microsoft Certification January 31, 2026

Practice Questions: Describe capabilities of the Azure AI Vision service (AI-900 Exam Prep)

Practice Exam Questions

Question 1

A company wants to automatically generate short descriptions such as “A group of people standing on a beach” for images uploaded to its website. No model training is required.

Which Azure service should be used?

A. Azure Machine Learning
B. Azure AI Vision image analysis
C. Azure Custom Vision
D. Azure OpenAI Service

Correct Answer: B

Explanation:
Azure AI Vision image analysis can generate natural language descriptions of images using prebuilt models. Azure Machine Learning and Custom Vision require training, and Azure OpenAI is not designed for image analysis tasks.

Question 2

Which Azure AI Vision capability extracts printed and handwritten text from scanned documents and images?

A. Image tagging
B. Object detection
C. Optical Character Recognition (OCR)
D. Facial analysis

Correct Answer: C

Explanation:
OCR is specifically designed to detect and extract text from images, including scanned documents and handwritten content.

Question 3

A developer needs to identify objects in an image and return their locations using bounding boxes.

Which Azure AI Vision feature should be used?

A. Image classification
B. Image tagging
C. Object detection
D. Image description

Correct Answer: C

Explanation:
Object detection identifies what objects are present and where they are located using bounding boxes and confidence scores.

Question 4

Which capability of Azure AI Vision can detect faces and return attributes such as estimated age and facial expression?

A. Facial recognition
B. Facial detection and facial analysis
C. Image classification
D. Custom Vision

Correct Answer: B

Explanation:
Azure AI Vision supports facial detection and analysis, which provides facial attributes but does not identify individuals.

Question 5

A solution must automatically assign keywords like “outdoor”, “food”, or “animal” to images for search and organization.

Which Azure AI Vision feature meets this requirement?

A. OCR
B. Object detection
C. Image tagging
D. Facial analysis

Correct Answer: C

Explanation:
Image tagging assigns descriptive labels to images to improve categorization and searchability.

Question 6

Which statement best describes Azure AI Vision?

A. It requires training a custom model for each scenario
B. It provides prebuilt computer vision capabilities through APIs
C. It is only used for facial recognition
D. It can only analyze video streams

Correct Answer: B

Explanation:
Azure AI Vision offers prebuilt computer vision models accessed via APIs, requiring no model training.

Question 7

A company wants to analyze images quickly without building or training a machine learning model.

Which Azure service is most appropriate?

A. Azure Machine Learning
B. Azure Custom Vision
C. Azure AI Vision
D. Azure Databricks

Correct Answer: C

Explanation:
Azure AI Vision is designed for quick deployment using prebuilt models, making it ideal when no custom training is required.

Question 8

Which task is NOT a capability of Azure AI Vision?

A. Detecting objects in an image
B. Extracting text from images
C. Identifying specific individuals in photos
D. Generating image descriptions

Correct Answer: C

Explanation:
Azure AI Vision does not identify individuals. Facial recognition and identity verification are restricted and not required for AI-900.

Question 9

A scenario mentions analyzing images while following Microsoft’s Responsible AI principles, particularly around privacy and fairness.

Which Azure AI Vision feature is most closely associated with these considerations?

A. Image tagging
B. Facial detection and analysis
C. OCR
D. Object detection

Correct Answer: B

Explanation:
Facial detection and analysis involve human data and are closely tied to privacy, fairness, and transparency considerations.

Question 10

When should Azure AI Vision be used instead of Azure Custom Vision?

A. When you need a highly specialized image classification model
B. When you want full control over training data
C. When you need prebuilt image analysis without training
D. When labeling thousands of custom images

Correct Answer: C

Explanation:
Azure AI Vision is ideal for prebuilt, general-purpose image analysis scenarios. Custom Vision is used when custom training is required.

Final Exam Tips for This Topic

Think prebuilt vs custom
Azure AI Vision = no training
OCR = text extraction
Object detection = what + where
Facial analysis ≠ facial recognition

Go to the AI-900 Exam Prep Hub main page.

AI, AI-900, Artificial Intelligence (AI), Computer Vision, Microsoft Certification January 31, 2026

Practice Questions: Describe Capabilities of the Azure AI Face Detection Service (AI-900 Exam Prep)

Practice Exam Questions

Question 1

A company wants to detect whether human faces appear in uploaded images and draw bounding boxes around them. The solution must not identify individuals.

Which Azure service should be used?

A. Azure Custom Vision
B. Azure AI Vision image classification
C. Azure AI Face detection
D. Azure OpenAI Service

Correct Answer: C

Explanation:
Azure AI Face detection is designed to detect faces and return their locations without identifying individuals. This aligns with privacy requirements and AI-900 expectations.

Question 2

Which task is supported by Azure AI Face detection?

A. Verifying a person’s identity against a database
B. Detecting the presence of human faces in an image
C. Training a custom facial recognition model
D. Authenticating users using facial biometrics

Correct Answer: B

Explanation:
Azure AI Face detection can detect faces and analyze facial attributes, but it does not perform identity verification or authentication.

Question 3

What type of information can Azure AI Face detection return for each detected face?

A. Person’s name and ID
B. Bounding box and facial attributes
C. Social media profile matches
D. Voice and speech characteristics

Correct Answer: B

Explanation:
The service returns face location (bounding box) and facial attributes such as estimated age or expression, not personal identity data.

Question 4

A scenario requires estimating whether people in an image appear to be smiling.

Which Azure AI Face detection capability supports this requirement?

A. Face identification
B. Facial attribute analysis
C. Image classification
D. Object detection

Correct Answer: B

Explanation:
Facial attribute analysis provides descriptive information such as facial expression, including whether a face appears to be smiling.

Question 5

Which statement best describes Azure AI Face detection for the AI-900 exam?

A. It requires training a custom dataset
B. It identifies known individuals in photos
C. It uses prebuilt models to analyze faces
D. It can only analyze video streams

Correct Answer: C

Explanation:
Azure AI Face detection uses pretrained models and requires no custom training, which is a key exam concept.

Question 6

A developer wants to count how many people appear in a group photo.

Which Azure AI service capability should be used?

A. OCR
B. Image tagging
C. Face detection
D. Image classification

Correct Answer: C

Explanation:
Face detection can identify multiple faces in a single image, making it suitable for counting people.

Question 7

Why is Azure AI Face detection closely associated with Responsible AI principles?

A. It uses unsupervised learning
B. It processes sensitive human biometric data
C. It requires large datasets
D. It supports only public images

Correct Answer: B

Explanation:
Facial data is considered sensitive personal data, so privacy, fairness, and transparency are especially important.

Question 8

Which scenario would be inappropriate for Azure AI Face detection?

A. Detecting faces in event photos
B. Estimating facial expressions
C. Identifying a person by name from an image
D. Drawing bounding boxes around faces

Correct Answer: C

Explanation:
Azure AI Face detection does not identify individuals. Identity recognition is outside the scope of AI-900 and restricted for ethical reasons.

Question 9

Which principle ensures users are informed when facial analysis is being used?

A. Reliability
B. Transparency
C. Inclusiveness
D. Sustainability

Correct Answer: B

Explanation:
Transparency requires that people understand when and how AI systems, such as facial detection, are being used.

Question 10

When comparing Azure AI Face detection with object detection, which statement is correct?

A. Object detection returns facial attributes
B. Face detection identifies any object in an image
C. Face detection focuses specifically on human faces
D. Both services identify individuals

Correct Answer: C

Explanation:
Face detection is specialized for human faces, while object detection identifies general objects like cars, animals, or furniture.

Exam Tip Recap 🔑

Face detection ≠ face recognition
Detects faces, locations, and attributes
Uses prebuilt models
Strong ties to Responsible AI

Go to the AI-900 Exam Prep Hub main page.

AI, AI-900, Artificial Intelligence (AI), Computer Vision, Microsoft Certification January 31, 2026

Describe Capabilities of the Azure AI Vision Service (AI-900 Exam Prep)

Overview

Azure AI Vision is Microsoft’s prebuilt computer vision service that enables applications to analyze images and videos without requiring machine learning expertise or custom model training. It provides REST APIs and SDKs that allow developers to easily extract visual insights such as objects, text, faces, and image descriptions.

For the AI-900 exam, you are expected to understand what Azure AI Vision can do, which problems it solves, and how it differs from custom vision solutions—not how to build or tune models.

What Is Azure AI Vision?

Azure AI Vision is part of Azure AI Services and offers ready-to-use computer vision capabilities, including:

Image analysis
Optical Character Recognition (OCR)
Facial detection and analysis
Object detection
Image tagging and categorization

These capabilities are powered by Microsoft-trained deep learning models and are accessed via APIs.

Core Capabilities of Azure AI Vision

1. Image Analysis

Azure AI Vision can analyze images to extract high-level insights, such as:

Objects present in an image (for example, car, building, person)
Scene descriptions in natural language
Image tags and categories
Visual features such as color distribution

Example use cases:

Auto-generating image captions
Content moderation
Organizing image libraries

👉 Exam tip: Image analysis describes what is in an image, not where every object is located with precision.

2. Object Detection

Object detection identifies specific objects in an image and returns:

Object names
Bounding box coordinates
Confidence scores

Example use cases:

Detecting vehicles in traffic images
Identifying products on store shelves

👉 Exam tip: Object detection includes location + object type, unlike image classification which only labels the image as a whole.

3. Optical Character Recognition (OCR)

OCR extracts printed and handwritten text from images and documents.

Azure AI Vision OCR supports:

Multiple languages
Structured and unstructured text
Images, screenshots, and scanned documents

Example use cases:

Digitizing receipts
Reading license plates
Extracting text from scanned forms

👉 Exam tip: OCR is about reading text, not understanding its meaning.

4. Facial Detection and Facial Analysis

Azure AI Vision can detect human faces in images and analyze non-identifying facial attributes, such as:

Face location (bounding boxes)
Facial landmarks
Estimated age range
Facial expressions
Accessories (glasses, masks)

⚠️ It does NOT identify individuals.

Example use cases:

Blurring faces for privacy
Counting people in images
Analyzing expressions in photos

👉 Exam tip:

Facial detection = where faces are
Facial analysis = attributes of faces
Facial recognition = identity (not required for AI-900)

5. Image Tagging and Categorization

Azure AI Vision automatically assigns tags and categories to images, such as:

“outdoor”
“food”
“animal”

These tags help with searchability and organization.

Example use cases:

Image indexing
Content filtering
Metadata enrichment

👉 Exam tip: Tagging helps describe images at a high level, not detect precise objects.

Azure AI Vision vs Custom Vision

Feature	Azure AI Vision	Azure Custom Vision
Prebuilt models	✅ Yes	❌ No
Requires training	❌ No	✅ Yes
Quick setup	✅ Yes	❌ No
Specialized scenarios	❌ Limited	✅ Strong
AI-900 focus	✅ Yes	⚠️ Limited

👉 Exam takeaway:
If the question mentions no training, quick setup, or prebuilt models, Azure AI Vision is usually the right answer.

Responsible AI Considerations

Because Azure AI Vision can analyze images of people, Microsoft emphasizes:

Privacy and security of image data
Transparency in how visual data is processed
Fairness and bias mitigation
Appropriate use of facial analysis

👉 Exam tip: Facial capabilities often pair with Responsible AI principles in exam questions.

Common AI-900 Exam Scenarios

You should recognize Azure AI Vision when the scenario involves:

Analyzing images without training a model
Extracting text from images
Detecting faces but not identifying people
Automatically tagging or describing images

Key Exam Takeaways

Azure AI Vision is a prebuilt computer vision service
No machine learning expertise required
Supports image analysis, OCR, object detection, and facial analysis
Focuses on insight extraction, not identity
Frequently tested in scenario-based questions

Go to the Practice Exam Questions for this topic.

Go to the AI-900 Exam Prep Hub main page.