Tag: Azure AI Vision Service

Identify capabilities of Azure AI services, including Azure AI Vision in Foundry Tools, Azure AI Search, and Microsoft Foundry (AB-731 Exam Prep)

This post is a part of the AB-731: AI Transformation Leader Exam Prep Hub.
This topic falls under these sections:
Identify benefits, capabilities, and opportunities for Microsoft’s AI apps and services (35–40%)
   --> Identify benefits and capabilities of Foundry Tools
      --> Identify capabilities of Azure AI services, including Azure AI Vision in Foundry Tools, Azure AI Search, and Microsoft Foundry


Note that there are 10 practice questions (with answers) at the end of each section to help you solidify your knowledge of the material. Also, there are 4 practice tests with 30 questions each available from the hub's main page below the exam topics section.

Introduction

One of the objectives in the AB-731: AI Transformation Leader exam is understanding how Microsoft’s AI platform capabilities can be applied to business problems. Leaders are not expected to build these solutions themselves, but they should understand which services are available, what problems they solve, and how they create business value.

This topic focuses on:

  • Azure AI Vision
  • Azure AI Search
  • Microsoft Foundry (Azure AI Foundry)
  • How these services work together to create enterprise AI solutions

Understanding Microsoft’s AI Platform

Microsoft provides a collection of AI services that allow organizations to:

  • Analyze images and documents
  • Search and retrieve organizational knowledge
  • Build generative AI applications
  • Create intelligent agents
  • Ground AI responses with enterprise data
  • Manage AI projects securely and responsibly

These services are available through Microsoft Foundry, which acts as a central environment for building, testing, and managing AI solutions.


Microsoft Foundry Overview

Microsoft Foundry (Azure AI Foundry) is Microsoft’s unified AI platform for developing and managing AI applications.

It provides:

  • Access to foundation models
  • Agent development tools
  • Prompt flows
  • Evaluation tools
  • Safety and content filtering
  • Knowledge grounding capabilities
  • Integration with Azure AI services
  • Monitoring and governance capabilities

Business Value

Foundry enables organizations to:

  • Accelerate AI development
  • Reduce complexity
  • Standardize AI projects
  • Improve governance
  • Support responsible AI practices
  • Build custom AI solutions without creating infrastructure from scratch

Azure AI Services

Azure AI services are prebuilt AI capabilities that developers can incorporate into applications.

Examples include:

ServicePurpose
Azure AI VisionAnalyze images and visual content
Azure AI SearchRetrieve and index enterprise information
Speech ServicesSpeech-to-text and text-to-speech
Language ServicesSentiment analysis, summarization, translation
Document IntelligenceExtract information from forms and documents

These services reduce development effort because organizations can use Microsoft’s pretrained models instead of building their own.


Azure AI Vision

Azure AI Vision enables AI systems to understand images and visual information.

Capabilities include:

Image Analysis

The service can identify:

  • Objects
  • People
  • Text
  • Colors
  • Scenes

Example:

A retailer can analyze product images automatically.


Optical Character Recognition (OCR)

AI Vision can extract text from:

  • Invoices
  • Receipts
  • Signs
  • Printed documents
  • Images

Example:

Insurance companies can process claim documents automatically.


Image Captioning

The service can generate descriptions of images.

Example:

“Two people sitting at a conference table using laptops.”

This improves accessibility and supports content management.


Spatial Analysis

Organizations can monitor movement and occupancy.

Example:

Retail stores can analyze customer traffic patterns.


Face Detection (Limited Scenarios)

AI Vision can locate faces in images, although Microsoft follows responsible AI principles and restricts facial recognition capabilities.


Azure AI Vision Within Foundry Tools

Inside Microsoft Foundry, AI Vision can become part of larger AI workflows.

For example:

  1. Upload an image.
  2. Extract text using OCR.
  3. Store results.
  4. Use generative AI to summarize findings.
  5. Present insights to users.

Business scenarios include:

Manufacturing

  • Defect detection
  • Quality control

Healthcare

  • Medical image support
  • Document digitization

Retail

  • Shelf monitoring
  • Product identification

Finance

  • Receipt processing
  • Expense automation

Azure AI Search

Azure AI Search is Microsoft’s enterprise search and retrieval platform.

It helps AI systems locate information from:

  • Documents
  • PDFs
  • Databases
  • Websites
  • Knowledge bases
  • SharePoint repositories

The service indexes content so information can be retrieved quickly.


Key Capabilities of Azure AI Search

1. Full-Text Search

Users can search documents using keywords.

Example:

“Show all contracts mentioning renewal dates.”


2. Semantic Search

Instead of matching only keywords, semantic search understands meaning.

Example:

Searching:

“Vacation rules”

may return documents titled:

“Employee Leave Policy”


3. Vector Search

Vector search finds content based on similarity rather than exact wording.

This capability is especially important for:

  • Generative AI
  • Retrieval-Augmented Generation (RAG)
  • Copilot solutions

4. Hybrid Search

Hybrid search combines:

  • Keyword search
  • Semantic search
  • Vector search

This produces more accurate results.


5. Security Trimming

Search results can respect existing permissions.

Users only see content they are authorized to access.

This is critical for enterprise AI systems.


Azure AI Search and RAG

One of the most important uses of Azure AI Search is supporting Retrieval-Augmented Generation (RAG).

RAG process:

  1. User asks a question.
  2. AI Search retrieves relevant information.
  3. Retrieved documents ground the model.
  4. The LLM generates a response based on company data.

Benefits:

  • Fewer hallucinations
  • More accurate responses
  • Current organizational information
  • Improved trust

Microsoft Foundry Capabilities

Model Catalog

Organizations can choose from multiple AI models.

Examples include:

  • OpenAI models
  • Microsoft models
  • Third-party models

Agent Development

Foundry supports creation of AI agents that can:

  • Perform tasks
  • Access data
  • Use tools
  • Execute workflows

Prompt Flow

Prompt Flow enables teams to:

  • Design prompts
  • Test prompts
  • Evaluate outputs
  • Optimize AI applications

Evaluations

Organizations can measure:

  • Accuracy
  • Relevance
  • Safety
  • Groundedness

This helps improve AI quality.


Responsible AI Features

Foundry includes:

  • Content filtering
  • Safety systems
  • Monitoring
  • Governance capabilities

These features help organizations implement responsible AI.


Data Grounding

Foundry integrates with:

  • Azure AI Search
  • Databases
  • Documents
  • External systems

Grounding improves response quality and reduces hallucinations.


Example End-to-End Scenario

A legal organization builds an AI assistant.

Step 1

Contracts are stored in SharePoint.

Step 2

Azure AI Search indexes documents.

Step 3

A user asks:

“Which contracts expire next quarter?”

Step 4

Relevant documents are retrieved.

Step 5

The language model generates an answer.

Step 6

Foundry applies safety controls and monitoring.

Result:

A secure, enterprise-grade AI assistant.


When to Use Each Service

NeedRecommended Service
Image analysisAzure AI Vision
OCR and text extractionAzure AI Vision
Enterprise searchAzure AI Search
RAG applicationsAzure AI Search
Model managementMicrosoft Foundry
Agent developmentMicrosoft Foundry
AI governanceMicrosoft Foundry
Evaluation and prompt testingMicrosoft Foundry

Key Exam Tips

Remember:

  • Azure AI Vision analyzes images and extracts text.
  • Azure AI Search retrieves and indexes enterprise knowledge.
  • Vector search and semantic search support RAG solutions.
  • Microsoft Foundry provides a unified AI development environment.
  • Foundry includes safety, evaluation, monitoring, and governance capabilities.
  • Azure AI services provide pretrained AI capabilities that reduce development effort.
  • These services work together to create enterprise AI solutions.

Practice Exam Questions


Question 1

A company wants to extract text from scanned invoices and automate expense processing. Which service should they primarily use?

A. Azure AI Search
B. Azure AI Vision
C. Microsoft Foundry Agent Service
D. Microsoft Fabric

Answer: B

Explanation:
Azure AI Vision provides OCR capabilities that can extract text from receipts and scanned documents.

  • A is incorrect because Search retrieves information rather than extracting text from images.
  • C is incorrect because agents use information but do not perform OCR directly.
  • D is incorrect because Fabric focuses on analytics and data workloads.

Question 2

Which capability of Azure AI Search helps retrieve documents based on meaning rather than exact keywords?

A. Full-text indexing
B. OCR
C. Semantic search
D. Content filtering

Answer: C

Explanation:
Semantic search understands context and intent, allowing related documents to be returned even when exact words differ.

  • A relies on keywords.
  • B belongs to Vision services.
  • D is a safety capability.

Question 3

What is a primary purpose of Microsoft Foundry?

A. Replacing Azure subscriptions
B. Serving as a unified environment for building and managing AI applications
C. Acting as a database engine
D. Providing endpoint security

Answer: B

Explanation:
Microsoft Foundry centralizes model access, prompt engineering, evaluations, governance, and AI application development.

  • A, C, and D describe unrelated technologies.

Question 4

Which search capability is especially important for Retrieval-Augmented Generation (RAG)?

A. Vector search
B. OCR
C. Batch processing
D. Image captioning

Answer: A

Explanation:
Vector search enables similarity-based retrieval, which is foundational to RAG systems.

  • B and D are Vision features.
  • C is unrelated.

Question 5

An organization wants AI responses to respect document permissions so employees only see authorized information. Which capability supports this requirement?

A. Image analysis
B. Prompt Flow
C. Security trimming
D. Caption generation

Answer: C

Explanation:
Security trimming ensures search results honor existing access permissions.

  • A and D are Vision capabilities.
  • B manages prompts rather than permissions.

Question 6

Which Microsoft service is primarily responsible for analyzing image content?

A. Azure AI Search
B. Microsoft Purview
C. Microsoft Defender for Cloud
D. Azure AI Vision

Answer: D

Explanation:
Azure AI Vision provides image analysis, OCR, and captioning capabilities.

  • The other services serve different purposes.

Question 7

What is one benefit of grounding generative AI with Azure AI Search?

A. Eliminates all security requirements
B. Removes the need for prompts
C. Reduces hallucinations and improves answer accuracy
D. Replaces foundation models

Answer: C

Explanation:
Grounding with enterprise data helps AI provide more reliable responses.

  • A, B, and D are incorrect.

Question 8

Which capability is provided directly by Microsoft Foundry?

A. Road traffic navigation
B. Prompt evaluation and testing
C. Firewall management
D. Email hosting

Answer: B

Explanation:
Foundry includes prompt flow and evaluation tools to improve AI quality.

  • The remaining options are unrelated.

Question 9

A retailer wants AI to identify products shown in photographs. Which service is most appropriate?

A. Azure AI Vision
B. Azure AI Search
C. Azure Virtual Desktop
D. Microsoft Intune

Answer: A

Explanation:
Image analysis capabilities in Azure AI Vision can recognize objects and visual content.

  • B retrieves documents.
  • C and D are endpoint technologies.

Question 10

Which combination best supports an enterprise RAG solution?

A. Azure AI Vision + Microsoft Intune
B. Power BI + Defender for Endpoint
C. Azure Virtual Network + Entra ID
D. Azure AI Search + Microsoft Foundry

Answer: D

Explanation:
Azure AI Search retrieves organizational information, while Microsoft Foundry provides the AI platform, models, and orchestration capabilities required to deliver grounded AI experiences.

  • The other combinations do not provide complete RAG functionality.

Go to the AB-731 Exam Prep Hub main page

Build a lightweight application that includes vision capabilities (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Implement AI solutions by using Microsoft Foundry (55–60%)
--> Implement AI solutions with computer vision and image-generation capabilities by using Foundry
--> Build a lightweight application that includes vision capabilities


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Computer vision enables AI systems to interpret and analyze visual information such as images and videos. Organizations use computer vision solutions for automation, accessibility, security, analytics, and customer experiences.

For the AI-901 certification exam, candidates should understand the foundational concepts behind building lightweight applications that include vision capabilities by using Microsoft Azure AI services and Azure AI Foundry.

This topic falls under the “Implement AI solutions with computer vision and image-generation capabilities by using Foundry” section of the AI-901 exam objectives.


What Is Computer Vision?

Computer vision is a field of AI that enables systems to analyze and understand visual information.

Visual data may include:

  • Images
  • Videos
  • Scanned documents
  • Camera feeds

Common Computer Vision Tasks

Computer vision systems commonly perform:

  • Image classification
  • Object detection
  • Optical character recognition (OCR)
  • Facial analysis
  • Image captioning
  • Content moderation

Azure AI Vision

Azure AI Vision provides computer vision capabilities through cloud-based AI services.

Features include:

  • Image analysis
  • OCR
  • Object detection
  • Image captioning
  • Facial attribute analysis

What Is a Lightweight Application?

A lightweight application is a simple application designed to perform focused tasks with minimal complexity and infrastructure.

Characteristics include:

  • Simple user interface
  • Fast deployment
  • Minimal resource usage
  • Easy maintenance

Examples of Lightweight Vision Applications

Examples include:

  • Image analysis tools
  • Receipt scanning apps
  • Accessibility assistants
  • Product recognition apps
  • Photo-tagging systems

Azure AI Foundry

Azure AI Foundry provides tools for building, testing, and managing AI-powered applications.

Developers can:

  • Access AI models
  • Deploy services
  • Test prompts
  • Build AI workflows

Image Classification

Image classification identifies the main category or subject of an image.


Example

Image

Photo of a bicycle

Classification

“Bicycle”


Object Detection

Object detection identifies multiple objects and their locations within an image.


Example

Image

Street scene

Detected Objects

  • Car
  • Traffic light
  • Pedestrian
  • Bicycle

Optical Character Recognition (OCR)

OCR extracts text from images and scanned documents.


Example

Image

Photo of a restaurant menu

Extracted Text

Menu items and prices


Image Captioning

Image captioning generates natural-language descriptions of images.


Example

Image

A dog playing in a park

Caption

“A brown dog running through a grassy park.”


Facial Analysis

Computer vision systems can analyze facial features.

Possible capabilities include:

  • Face detection
  • Emotion analysis
  • Age estimation

For Responsible AI reasons, facial recognition and identification systems require careful consideration.


APIs and Endpoints

Applications communicate with Azure AI services using:

  • APIs
  • Endpoints

These allow images to be analyzed programmatically.


Authentication

Applications must securely authenticate before accessing Azure AI services.

Common authentication methods include:

  • API keys
  • Azure credentials
  • Managed identities

User Interface Components

A lightweight vision application may include:

  • Image upload area
  • Camera capture button
  • Results display
  • Image preview

Real-Time Image Processing

Some applications process images in near real time.

Examples include:

  • Security monitoring
  • Live object detection
  • Accessibility tools

Example Workflow

A common workflow includes:

  1. User uploads image
  2. Application sends image to Azure AI Vision
  3. AI service analyzes image
  4. Results are returned
  5. Application displays findings

Example High-Level Pseudocode

image = upload_image()
results = analyze_image(image)
display_results(results)

For AI-901, understanding the workflow is more important than memorizing exact syntax.


Common Real-World Scenarios


Scenario 1: Receipt Scanner

Goal

Extract purchase information from receipts.

Features

  • OCR
  • Text extraction
  • Data organization

Scenario 2: Accessibility Assistant

Goal

Describe images for visually impaired users.

Features

  • Image captioning
  • OCR
  • Spoken descriptions

Scenario 3: Product Recognition

Goal

Identify products from photos.

Features

  • Object detection
  • Classification
  • Product lookup

Scenario 4: Content Moderation

Goal

Identify harmful or inappropriate images.

Features

  • Image analysis
  • Safety detection
  • Automated filtering

Responsible AI Considerations

Vision-enabled applications should follow Responsible AI principles.

Key considerations include:

  • Fairness
  • Privacy
  • Transparency
  • Inclusiveness
  • Accountability
  • Security

Privacy Concerns

Images may contain:

  • Personal data
  • Faces
  • Sensitive documents
  • Location information

Organizations should protect visual data appropriately.


Bias and Fairness

Computer vision systems may perform unevenly across:

  • Skin tones
  • Lighting conditions
  • Demographics
  • Environmental conditions

Testing and evaluation are important for fairness.


Transparency

Users should understand:

  • AI is analyzing images
  • AI-generated results may contain errors
  • Images may be processed in the cloud

Hallucinations and Errors

Vision systems may occasionally generate:

  • Incorrect captions
  • False detections
  • Inaccurate classifications

These incorrect outputs are sometimes called hallucinations.


Error Handling

Applications should handle:

  • Invalid image formats
  • Poor image quality
  • Authentication failures
  • Network interruptions
  • Rate limits

Image Quality Challenges

Computer vision accuracy can decrease with:

  • Blurry images
  • Poor lighting
  • Low resolution
  • Obstructed objects

Advantages of Vision Applications

Benefits include:

  • Automation
  • Faster analysis
  • Accessibility improvements
  • Improved customer experiences
  • Scalable image processing

Limitations of Vision Applications

Challenges include:

  • Recognition inaccuracies
  • Bias
  • Privacy concerns
  • Variable image quality
  • Ethical considerations

High-Level Architecture

A simplified architecture often includes:

  1. User interface
  2. Image upload/capture
  3. Azure AI Vision service
  4. AI analysis
  5. Results display

Generative Vision Capabilities

Some modern systems combine:

  • Computer vision
  • Generative AI

These multimodal systems can:

  • Analyze images
  • Generate descriptions
  • Answer visual questions
  • Create new images

Important AI-901 Exam Tips

For the exam, remember these key points:

  • Computer vision analyzes visual information.
  • Azure AI Vision provides computer vision capabilities.
  • OCR extracts text from images.
  • Object detection identifies multiple objects in images.
  • Image captioning generates natural-language image descriptions.
  • APIs and endpoints connect applications to Azure AI services.
  • Authentication secures service access.
  • Responsible AI principles apply to computer vision systems.
  • Image quality affects AI accuracy.
  • Hallucinations are inaccurate AI-generated outputs.

Quick Knowledge Check

Question 1

What does OCR do?

Answer

Extracts text from images and scanned documents.


Question 2

What is object detection?

Answer

Identifying and locating objects within an image.


Question 3

Why is authentication important?

Answer

It secures access to Azure AI services.


Question 4

What can reduce computer vision accuracy?

Answer

Poor image quality such as blur or low lighting.


Practice Exam Questions

Question 1

What is the PRIMARY purpose of computer vision?

A. To enable AI systems to analyze and understand visual information
B. To increase internet bandwidth
C. To manage database backups
D. To improve keyboard performance


Correct Answer

A. To enable AI systems to analyze and understand visual information


Explanation

Computer vision allows AI systems to process and interpret images, videos, and other visual data.


Why the Other Answers Are Incorrect

B. To increase internet bandwidth

Computer vision does not affect networking speed.

C. To manage database backups

This is unrelated to computer vision.

D. To improve keyboard performance

This is unrelated to AI vision systems.


Question 2

Which Azure service provides computer vision capabilities such as OCR and image analysis?

A. Azure AI Vision
B. Azure Backup
C. Azure Virtual Machines
D. Azure DNS


Correct Answer

A. Azure AI Vision


Explanation

Azure AI Vision provides cloud-based computer vision capabilities including OCR, object detection, and image captioning.


Why the Other Answers Are Incorrect

B. Azure Backup

This is a backup service.

C. Azure Virtual Machines

This provides compute infrastructure.

D. Azure DNS

This is a networking service.


Question 3

What does OCR stand for?

A. Optical Character Recognition
B. Open Cloud Rendering
C. Object Classification Registry
D. Operational Compute Routing


Correct Answer

A. Optical Character Recognition


Explanation

OCR extracts text from images or scanned documents.


Why the Other Answers Are Incorrect

B. Open Cloud Rendering

This is not the meaning of OCR.

C. Object Classification Registry

This is unrelated to OCR.

D. Operational Compute Routing

This is not a computer vision term.


Question 4

What is the PRIMARY purpose of object detection?

A. To identify and locate objects within an image
B. To translate spoken language
C. To summarize long documents
D. To compress image files


Correct Answer

A. To identify and locate objects within an image


Explanation

Object detection identifies multiple objects and their locations inside an image.


Why the Other Answers Are Incorrect

B. To translate spoken language

This is a speech AI task.

C. To summarize long documents

This is a text analysis task.

D. To compress image files

Object detection does not compress files.


Question 5

What does image captioning do?

A. Generates natural-language descriptions of images
B. Converts speech into text
C. Encrypts image files
D. Creates database tables


Correct Answer

A. Generates natural-language descriptions of images


Explanation

Image captioning creates human-readable descriptions of visual content.


Why the Other Answers Are Incorrect

B. Converts speech into text

This is speech recognition.

C. Encrypts image files

Encryption is unrelated to captioning.

D. Creates database tables

This is unrelated to computer vision.


Question 6

How do lightweight vision applications typically communicate with Azure AI services?

A. Through APIs and endpoints
B. Through printer drivers
C. Through monitor settings
D. Through USB-only connections


Correct Answer

A. Through APIs and endpoints


Explanation

Applications use APIs and cloud endpoints to send images and receive AI-generated analysis results.


Why the Other Answers Are Incorrect

B. Through printer drivers

Printers are unrelated to AI communication.

C. Through monitor settings

This is unrelated to cloud AI services.

D. Through USB-only connections

Cloud services use network communication.


Question 7

Why is authentication important when accessing Azure AI Vision services?

A. To secure access to AI resources
B. To increase image brightness
C. To improve keyboard response time
D. To accelerate internet speeds


Correct Answer

A. To secure access to AI resources


Explanation

Authentication helps ensure that only authorized users and applications can access Azure AI services.


Why the Other Answers Are Incorrect

B. To increase image brightness

Authentication does not affect image quality.

C. To improve keyboard response time

This is unrelated to authentication.

D. To accelerate internet speeds

Authentication does not improve network performance.


Question 8

Which Responsible AI concern is especially important in computer vision systems?

A. Protecting personal and sensitive visual information
B. Increasing monitor resolution
C. Improving printer speed
D. Reducing spreadsheet file sizes


Correct Answer

A. Protecting personal and sensitive visual information


Explanation

Images may contain faces, documents, or other sensitive information that must be protected.


Why the Other Answers Are Incorrect

B. Increasing monitor resolution

This is unrelated to Responsible AI.

C. Improving printer speed

Printers are unrelated to computer vision ethics.

D. Reducing spreadsheet file sizes

This is unrelated to image analysis.


Question 9

What challenge can reduce computer vision accuracy?

A. Poor image quality
B. Spreadsheet formatting
C. Keyboard layout changes
D. Audio playback speed


Correct Answer

A. Poor image quality


Explanation

Blur, low lighting, and low resolution can negatively affect image analysis accuracy.


Why the Other Answers Are Incorrect

B. Spreadsheet formatting

This does not affect vision systems.

C. Keyboard layout changes

This is unrelated to image processing.

D. Audio playback speed

This is unrelated to computer vision.


Question 10

What are hallucinations in AI vision systems?

A. Incorrect or fabricated AI-generated outputs
B. Hardware installation failures
C. Network outages
D. Printer connection problems


Correct Answer

A. Incorrect or fabricated AI-generated outputs


Explanation

Hallucinations occur when AI systems generate inaccurate descriptions or detections.


Why the Other Answers Are Incorrect

B. Hardware installation failures

This is unrelated to AI-generated outputs.

C. Network outages

This is a connectivity issue.

D. Printer connection problems

This is unrelated to AI vision systems.


Final Thoughts

Building lightweight applications with vision capabilities is an important topic for the AI-901 certification exam. Microsoft expects candidates to understand the foundational concepts behind computer vision applications, including image classification, object detection, OCR, APIs, authentication, Responsible AI principles, and real-world implementation workflows.

Azure AI Vision and Azure AI Foundry provide powerful cloud-based tools that make it easier to build intelligent applications capable of analyzing and understanding visual information.


Go to the AI-901 Exam Prep Hub main page

Practice Questions: Describe capabilities of the Azure AI Vision service (AI-900 Exam Prep)

Practice Exam Questions


Question 1

A company wants to automatically generate short descriptions such as “A group of people standing on a beach” for images uploaded to its website. No model training is required.

Which Azure service should be used?

A. Azure Machine Learning
B. Azure AI Vision image analysis
C. Azure Custom Vision
D. Azure OpenAI Service

Correct Answer: B

Explanation:
Azure AI Vision image analysis can generate natural language descriptions of images using prebuilt models. Azure Machine Learning and Custom Vision require training, and Azure OpenAI is not designed for image analysis tasks.


Question 2

Which Azure AI Vision capability extracts printed and handwritten text from scanned documents and images?

A. Image tagging
B. Object detection
C. Optical Character Recognition (OCR)
D. Facial analysis

Correct Answer: C

Explanation:
OCR is specifically designed to detect and extract text from images, including scanned documents and handwritten content.


Question 3

A developer needs to identify objects in an image and return their locations using bounding boxes.

Which Azure AI Vision feature should be used?

A. Image classification
B. Image tagging
C. Object detection
D. Image description

Correct Answer: C

Explanation:
Object detection identifies what objects are present and where they are located using bounding boxes and confidence scores.


Question 4

Which capability of Azure AI Vision can detect faces and return attributes such as estimated age and facial expression?

A. Facial recognition
B. Facial detection and facial analysis
C. Image classification
D. Custom Vision

Correct Answer: B

Explanation:
Azure AI Vision supports facial detection and analysis, which provides facial attributes but does not identify individuals.


Question 5

A solution must automatically assign keywords like “outdoor”, “food”, or “animal” to images for search and organization.

Which Azure AI Vision feature meets this requirement?

A. OCR
B. Object detection
C. Image tagging
D. Facial analysis

Correct Answer: C

Explanation:
Image tagging assigns descriptive labels to images to improve categorization and searchability.


Question 6

Which statement best describes Azure AI Vision?

A. It requires training a custom model for each scenario
B. It provides prebuilt computer vision capabilities through APIs
C. It is only used for facial recognition
D. It can only analyze video streams

Correct Answer: B

Explanation:
Azure AI Vision offers prebuilt computer vision models accessed via APIs, requiring no model training.


Question 7

A company wants to analyze images quickly without building or training a machine learning model.

Which Azure service is most appropriate?

A. Azure Machine Learning
B. Azure Custom Vision
C. Azure AI Vision
D. Azure Databricks

Correct Answer: C

Explanation:
Azure AI Vision is designed for quick deployment using prebuilt models, making it ideal when no custom training is required.


Question 8

Which task is NOT a capability of Azure AI Vision?

A. Detecting objects in an image
B. Extracting text from images
C. Identifying specific individuals in photos
D. Generating image descriptions

Correct Answer: C

Explanation:
Azure AI Vision does not identify individuals. Facial recognition and identity verification are restricted and not required for AI-900.


Question 9

A scenario mentions analyzing images while following Microsoft’s Responsible AI principles, particularly around privacy and fairness.

Which Azure AI Vision feature is most closely associated with these considerations?

A. Image tagging
B. Facial detection and analysis
C. OCR
D. Object detection

Correct Answer: B

Explanation:
Facial detection and analysis involve human data and are closely tied to privacy, fairness, and transparency considerations.


Question 10

When should Azure AI Vision be used instead of Azure Custom Vision?

A. When you need a highly specialized image classification model
B. When you want full control over training data
C. When you need prebuilt image analysis without training
D. When labeling thousands of custom images

Correct Answer: C

Explanation:
Azure AI Vision is ideal for prebuilt, general-purpose image analysis scenarios. Custom Vision is used when custom training is required.


Final Exam Tips for This Topic

  • Think prebuilt vs custom
  • Azure AI Vision = no training
  • OCR = text extraction
  • Object detection = what + where
  • Facial analysis ≠ facial recognition

Go to the AI-900 Exam Prep Hub main page.

Practice Questions: Describe Capabilities of the Azure AI Face Detection Service (AI-900 Exam Prep)

Practice Exam Questions


Question 1

A company wants to detect whether human faces appear in uploaded images and draw bounding boxes around them. The solution must not identify individuals.

Which Azure service should be used?

A. Azure Custom Vision
B. Azure AI Vision image classification
C. Azure AI Face detection
D. Azure OpenAI Service

Correct Answer: C

Explanation:
Azure AI Face detection is designed to detect faces and return their locations without identifying individuals. This aligns with privacy requirements and AI-900 expectations.


Question 2

Which task is supported by Azure AI Face detection?

A. Verifying a person’s identity against a database
B. Detecting the presence of human faces in an image
C. Training a custom facial recognition model
D. Authenticating users using facial biometrics

Correct Answer: B

Explanation:
Azure AI Face detection can detect faces and analyze facial attributes, but it does not perform identity verification or authentication.


Question 3

What type of information can Azure AI Face detection return for each detected face?

A. Person’s name and ID
B. Bounding box and facial attributes
C. Social media profile matches
D. Voice and speech characteristics

Correct Answer: B

Explanation:
The service returns face location (bounding box) and facial attributes such as estimated age or expression, not personal identity data.


Question 4

A scenario requires estimating whether people in an image appear to be smiling.

Which Azure AI Face detection capability supports this requirement?

A. Face identification
B. Facial attribute analysis
C. Image classification
D. Object detection

Correct Answer: B

Explanation:
Facial attribute analysis provides descriptive information such as facial expression, including whether a face appears to be smiling.


Question 5

Which statement best describes Azure AI Face detection for the AI-900 exam?

A. It requires training a custom dataset
B. It identifies known individuals in photos
C. It uses prebuilt models to analyze faces
D. It can only analyze video streams

Correct Answer: C

Explanation:
Azure AI Face detection uses pretrained models and requires no custom training, which is a key exam concept.


Question 6

A developer wants to count how many people appear in a group photo.

Which Azure AI service capability should be used?

A. OCR
B. Image tagging
C. Face detection
D. Image classification

Correct Answer: C

Explanation:
Face detection can identify multiple faces in a single image, making it suitable for counting people.


Question 7

Why is Azure AI Face detection closely associated with Responsible AI principles?

A. It uses unsupervised learning
B. It processes sensitive human biometric data
C. It requires large datasets
D. It supports only public images

Correct Answer: B

Explanation:
Facial data is considered sensitive personal data, so privacy, fairness, and transparency are especially important.


Question 8

Which scenario would be inappropriate for Azure AI Face detection?

A. Detecting faces in event photos
B. Estimating facial expressions
C. Identifying a person by name from an image
D. Drawing bounding boxes around faces

Correct Answer: C

Explanation:
Azure AI Face detection does not identify individuals. Identity recognition is outside the scope of AI-900 and restricted for ethical reasons.


Question 9

Which principle ensures users are informed when facial analysis is being used?

A. Reliability
B. Transparency
C. Inclusiveness
D. Sustainability

Correct Answer: B

Explanation:
Transparency requires that people understand when and how AI systems, such as facial detection, are being used.


Question 10

When comparing Azure AI Face detection with object detection, which statement is correct?

A. Object detection returns facial attributes
B. Face detection identifies any object in an image
C. Face detection focuses specifically on human faces
D. Both services identify individuals

Correct Answer: C

Explanation:
Face detection is specialized for human faces, while object detection identifies general objects like cars, animals, or furniture.


Exam Tip Recap 🔑

  • Face detection ≠ face recognition
  • Detects faces, locations, and attributes
  • Uses prebuilt models
  • Strong ties to Responsible AI

Go to the AI-900 Exam Prep Hub main page.

Describe Capabilities of the Azure AI Vision Service (AI-900 Exam Prep)

Overview

Azure AI Vision is Microsoft’s prebuilt computer vision service that enables applications to analyze images and videos without requiring machine learning expertise or custom model training. It provides REST APIs and SDKs that allow developers to easily extract visual insights such as objects, text, faces, and image descriptions.

For the AI-900 exam, you are expected to understand what Azure AI Vision can do, which problems it solves, and how it differs from custom vision solutions—not how to build or tune models.


What Is Azure AI Vision?

Azure AI Vision is part of Azure AI Services and offers ready-to-use computer vision capabilities, including:

  • Image analysis
  • Optical Character Recognition (OCR)
  • Facial detection and analysis
  • Object detection
  • Image tagging and categorization

These capabilities are powered by Microsoft-trained deep learning models and are accessed via APIs.


Core Capabilities of Azure AI Vision

1. Image Analysis

Azure AI Vision can analyze images to extract high-level insights, such as:

  • Objects present in an image (for example, car, building, person)
  • Scene descriptions in natural language
  • Image tags and categories
  • Visual features such as color distribution

Example use cases:

  • Auto-generating image captions
  • Content moderation
  • Organizing image libraries

👉 Exam tip: Image analysis describes what is in an image, not where every object is located with precision.


2. Object Detection

Object detection identifies specific objects in an image and returns:

  • Object names
  • Bounding box coordinates
  • Confidence scores

Example use cases:

  • Detecting vehicles in traffic images
  • Identifying products on store shelves

👉 Exam tip: Object detection includes location + object type, unlike image classification which only labels the image as a whole.


3. Optical Character Recognition (OCR)

OCR extracts printed and handwritten text from images and documents.

Azure AI Vision OCR supports:

  • Multiple languages
  • Structured and unstructured text
  • Images, screenshots, and scanned documents

Example use cases:

  • Digitizing receipts
  • Reading license plates
  • Extracting text from scanned forms

👉 Exam tip: OCR is about reading text, not understanding its meaning.


4. Facial Detection and Facial Analysis

Azure AI Vision can detect human faces in images and analyze non-identifying facial attributes, such as:

  • Face location (bounding boxes)
  • Facial landmarks
  • Estimated age range
  • Facial expressions
  • Accessories (glasses, masks)

⚠️ It does NOT identify individuals.

Example use cases:

  • Blurring faces for privacy
  • Counting people in images
  • Analyzing expressions in photos

👉 Exam tip:

  • Facial detection = where faces are
  • Facial analysis = attributes of faces
  • Facial recognition = identity (not required for AI-900)

5. Image Tagging and Categorization

Azure AI Vision automatically assigns tags and categories to images, such as:

  • “outdoor”
  • “food”
  • “animal”

These tags help with searchability and organization.

Example use cases:

  • Image indexing
  • Content filtering
  • Metadata enrichment

👉 Exam tip: Tagging helps describe images at a high level, not detect precise objects.


Azure AI Vision vs Custom Vision

FeatureAzure AI VisionAzure Custom Vision
Prebuilt models✅ Yes❌ No
Requires training❌ No✅ Yes
Quick setup✅ Yes❌ No
Specialized scenarios❌ Limited✅ Strong
AI-900 focus✅ Yes⚠️ Limited

👉 Exam takeaway:
If the question mentions no training, quick setup, or prebuilt models, Azure AI Vision is usually the right answer.


Responsible AI Considerations

Because Azure AI Vision can analyze images of people, Microsoft emphasizes:

  • Privacy and security of image data
  • Transparency in how visual data is processed
  • Fairness and bias mitigation
  • Appropriate use of facial analysis

👉 Exam tip: Facial capabilities often pair with Responsible AI principles in exam questions.


Common AI-900 Exam Scenarios

You should recognize Azure AI Vision when the scenario involves:

  • Analyzing images without training a model
  • Extracting text from images
  • Detecting faces but not identifying people
  • Automatically tagging or describing images

Key Exam Takeaways

  • Azure AI Vision is a prebuilt computer vision service
  • No machine learning expertise required
  • Supports image analysis, OCR, object detection, and facial analysis
  • Focuses on insight extraction, not identity
  • Frequently tested in scenario-based questions

Go to the Practice Exam Questions for this topic.

Go to the AI-900 Exam Prep Hub main page.