This post is a part of the AI-103: Develop AI Apps and Agents on Azure Exam Prep Hub. 
This topic falls under these sections:
Plan and manage an Azure AI solution (25–30%)
   --> Choose the appropriate Foundry services for generative AI and agents
      --> Choose an appropriate method for retrieval and indexing

Note that there are 10 practice questions (with answers and explanations) at the end of each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available from the hub's main page below the exam topics section.

Introduction

One of the most important concepts in modern AI applications is the ability to retrieve the correct information efficiently and accurately.

The AI-103: Develop AI Apps and Agents on Azure certification exam heavily tests knowledge related to:

Retrieval methods
Indexing strategies
Vector search
Semantic search
Retrieval-Augmented Generation (RAG)
Hybrid search
Embeddings
Knowledge grounding

Modern AI systems are often only as effective as their retrieval systems.

Even highly advanced Large Language Models (LLMs) can:

Hallucinate
Provide outdated information
Miss relevant context

Retrieval and indexing systems solve these problems by providing grounded, relevant, and searchable information to AI applications.

For the AI-103 exam, you should understand:

Different retrieval methods
Different indexing approaches
When to use vector search
When keyword search is appropriate
When hybrid search is preferred
How embeddings support retrieval
How Azure AI Search supports enterprise AI systems
How RAG architectures work

What Is Retrieval?

Retrieval is the process of locating and returning relevant information from a data source.

Examples include:

Searching documents
Finding relevant knowledge articles
Retrieving product descriptions
Returning similar documents
Finding semantically related content

Retrieval is essential for:

AI copilots
Enterprise chatbots
Knowledge assistants
Search applications
Recommendation systems
AI agents

What Is Indexing?

Indexing is the process of organizing data to make retrieval efficient.

An index acts like a searchable map of content.

Without indexing:

Searches are slower
Retrieval is inefficient
AI systems scale poorly

Indexes may include:

Keywords
Metadata
Embeddings
Semantic relationships
Document structure

Why Retrieval and Indexing Matter in AI

Modern generative AI applications often use Retrieval-Augmented Generation (RAG).

RAG combines:

Retrieval systems
Search indexes
Embeddings
LLMs

This allows AI systems to:

Access current information
Use enterprise knowledge
Reduce hallucinations
Provide grounded answers
Improve accuracy

Azure Services for Retrieval and Indexing

The primary Azure service for retrieval and indexing is:

Azure AI Search

Additional supporting services include:

Azure OpenAI
Embedding models
Azure Cosmos DB
Azure SQL Database
Azure Blob Storage

Azure AI Search

Azure AI Search is Microsoft’s enterprise search platform.

It supports:

Full-text search
Semantic search
Vector search
Hybrid search
AI enrichment
Indexing pipelines

Azure AI Search is a core AI-103 exam topic.

Retrieval Methods

There are several major retrieval methods you must understand for AI-103.

Keyword Search

What Is Keyword Search?

Keyword search retrieves documents based on exact word matches.

Example:

Searching for:

“cloud security”

Returns documents containing those exact terms.

Advantages of Keyword Search

Fast
Simple
Efficient for exact matches
Mature technology
Works well for structured terminology

Limitations of Keyword Search

Keyword search struggles with:

Synonyms
Contextual meaning
Natural language understanding
Conceptual similarity

Example:

A search for:

“car”

May not return documents containing:

“vehicle”

When to Use Keyword Search

Use keyword search when:

Exact term matching is important
Queries are highly structured
Performance and simplicity matter
Semantic understanding is unnecessary

Semantic Search

What Is Semantic Search?

Semantic search understands meaning and context rather than relying only on exact words.

It uses AI to interpret:

Intent
Context
Relationships between concepts

Example of Semantic Search

A query for:

“How do I secure cloud infrastructure?”

May retrieve documents about:

Azure security
Network protection
Cloud compliance

Even if the exact words differ.

Advantages of Semantic Search

Better contextual understanding
Improved relevance
More natural interactions
Better user experience

Limitations of Semantic Search

More computationally expensive
May increase latency
Requires more advanced indexing

When to Use Semantic Search

Use semantic search when:

Natural language queries are common
Relevance is important
Users may not know exact terminology
Context matters

Vector Search

What Is Vector Search?

Vector search retrieves information using embeddings.

Embeddings are numerical vector representations of content.

Documents with similar meaning have vectors that are mathematically close.

How Vector Search Works

Documents are converted into embeddings
Embeddings are stored in a vector index
User queries are converted into embeddings
Similarity algorithms identify related vectors
Relevant documents are returned

Advantages of Vector Search

Excellent semantic similarity matching
Supports RAG architectures
Finds conceptually related content
Works well with natural language queries

Limitations of Vector Search

Higher storage requirements
More computational overhead
Requires embedding generation
More complex implementation

When to Use Vector Search

Use vector search when:

Building RAG systems
Implementing AI copilots
Performing semantic retrieval
Supporting conversational AI
Searching unstructured content

Hybrid Search

What Is Hybrid Search?

Hybrid search combines:

Keyword search
Semantic search
Vector search

This approach often produces the best retrieval quality.

Why Hybrid Search Matters

Hybrid search combines the strengths of multiple retrieval approaches.

Benefits include:

Exact keyword matching
Semantic understanding
Contextual similarity
Improved ranking quality

When to Use Hybrid Search

Use hybrid search when:

High retrieval quality is required
Enterprise search is needed
AI copilots require strong grounding
Search relevance is critical

Hybrid search is commonly used in production RAG systems.

Embeddings

What Are Embeddings?

Embeddings are numerical representations of data.

Embedding models transform:

Text
Images
Documents

Into vectors.

Embeddings capture semantic meaning.

Embedding Models

Azure OpenAI provides embedding models used for:

Vector search
Similarity matching
RAG systems
Recommendation systems

Chunking Strategies

What Is Chunking?

Chunking is the process of breaking large documents into smaller sections before indexing.

Chunking improves retrieval quality because:

Smaller chunks are easier to match
Context becomes more precise
Retrieval relevance improves

Common Chunking Methods

Fixed-Size Chunking

Documents are split into equal-sized chunks.

Advantages:

Simple
Easy to implement

Disadvantages:

May split important context

Semantic Chunking

Documents are split based on meaning or structure.

Advantages:

Better contextual integrity
Improved retrieval quality

Disadvantages:

More complex

Overlapping Chunks

Adjacent chunks share some content.

Advantages:

Preserves context continuity
Improves retrieval accuracy

Disadvantages:

Increased storage usage

Choosing a Chunking Strategy

Use Fixed-Size Chunking When:

Simplicity is important
Documents are uniform
Rapid implementation is needed

Use Semantic Chunking When:

Context preservation matters
Documents contain sections/topics
Retrieval quality is critical

Use Overlapping Chunks When:

Context continuity is important
Long-form content is indexed

Metadata Filtering

Indexes may include metadata such as:

Author
Date
Department
Category
Security level

Metadata filtering improves:

Precision
Security
Retrieval efficiency

Example Metadata Filtering Scenario

An enterprise chatbot retrieves only documents:

From HR
Created within the last year
Approved for employee access

Metadata filters help enforce these constraints.

Retrieval-Augmented Generation (RAG)

What Is RAG?

Retrieval-Augmented Generation combines retrieval systems with LLMs.

The workflow:

User submits a query
Query becomes an embedding
Vector search retrieves relevant documents
Retrieved content is added to the prompt
LLM generates grounded response

Benefits of RAG

RAG helps:

Reduce hallucinations
Use current enterprise data
Avoid retraining models
Improve factual accuracy
Support enterprise AI assistants

Choosing Retrieval Methods for RAG

Keyword Search

Best for:

Exact terminology
Compliance searches
Structured queries

Vector Search

Best for:

Semantic similarity
Natural language queries
Conversational AI

Hybrid Search

Best for:

Enterprise copilots
High-quality retrieval
Production RAG systems

Indexing Pipelines

What Is an Indexing Pipeline?

An indexing pipeline automates:

Data ingestion
Document parsing
Chunking
Embedding generation
Metadata extraction
Index updates

AI Enrichment

Azure AI Search supports AI enrichment during indexing.

AI enrichment may include:

OCR
Entity extraction
Key phrase extraction
Language detection
Image analysis

Incremental Indexing

Incremental indexing updates only changed documents.

Benefits:

Faster indexing
Lower compute costs
Better scalability

Full Reindexing

Full reindexing rebuilds the entire index.

Use when:

Schema changes occur
Embedding models change
Large structural updates are required

Choosing an Indexing Strategy

Use Incremental Indexing When:

Data changes frequently
Efficiency matters
Large datasets exist

Use Full Reindexing When:

Major schema updates occur
Embedding strategy changes
Large-scale restructuring is required

Security and Access Control

Retrieval systems often include:

Role-based access control
Document-level security
Metadata-based filtering

This ensures users retrieve only authorized content.

Common AI-103 Scenarios

Scenario 1: Enterprise Knowledge Assistant

Requirements:

Conversational search
Semantic retrieval
Enterprise grounding

Recommended Approach:

Azure AI Search
Embeddings
Hybrid search
RAG

Scenario 2: Compliance Document Search

Requirements:

Exact terminology
Legal references
Precision retrieval

Recommended Approach:

Keyword search
Metadata filtering

Scenario 3: AI Copilot

Requirements:

Natural language queries
Contextual retrieval
Strong relevance

Recommended Approach:

Hybrid search
Vector search
Embeddings

Scenario 4: Product Recommendation System

Requirements:

Similarity matching
Semantic relationships

Recommended Approach:

Embeddings
Vector search

Common AI-103 Exam Tips

Understand Retrieval Tradeoffs

Keyword Search

Fast
Exact matching
Weak semantic understanding

Semantic Search

Better contextual understanding
More advanced relevance

Vector Search

Best for semantic similarity
Requires embeddings

Hybrid Search

Often best overall retrieval quality

Know the Relationship Between Embeddings and Vector Search

Embeddings enable vector search.

Without embeddings, vector search cannot function.

Understand RAG Architectures

RAG combines:

Retrieval
Indexing
Vector search
LLMs

This is one of the MOST important AI-103 topics.

Learn Chunking Concepts

Chunking affects:

Retrieval quality
Context preservation
Index efficiency

Chunking questions commonly appear in scenario-based exam questions.

Summary

Retrieval and indexing are foundational components of modern AI systems.

For the AI-103 exam, you should understand:

Keyword search
Semantic search
Vector search
Hybrid search
Embeddings
Chunking strategies
Metadata filtering
Indexing pipelines
Incremental indexing
RAG architectures
Azure AI Search capabilities

Choosing the correct retrieval and indexing approach directly affects:

AI accuracy
Groundedness
Scalability
Cost
Performance
User experience

Strong retrieval systems are essential for enterprise AI copilots, chatbots, and AI agents.

Hybrid search combines multiple retrieval methods for improved relevance.

Go to the AI-103 Exam Prep Hub main page