Choose an appropriate method for retrieval and indexing (AI-103 Exam Prep)

This post is a part of the AI-103: Develop AI Apps and Agents on Azure Exam Prep Hub. 
This topic falls under these sections:
Plan and manage an Azure AI solution (25–30%)
--> Choose the appropriate Foundry services for generative AI and agents
--> Choose an appropriate method for retrieval and indexing


Note that there are 10 practice questions (with answers and explanations) at the end of each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available from the hub's main page below the exam topics section.

Introduction

One of the most important concepts in modern AI applications is the ability to retrieve the correct information efficiently and accurately.

The AI-103: Develop AI Apps and Agents on Azure certification exam heavily tests knowledge related to:

  • Retrieval methods
  • Indexing strategies
  • Vector search
  • Semantic search
  • Retrieval-Augmented Generation (RAG)
  • Hybrid search
  • Embeddings
  • Knowledge grounding

Modern AI systems are often only as effective as their retrieval systems.

Even highly advanced Large Language Models (LLMs) can:

  • Hallucinate
  • Provide outdated information
  • Miss relevant context

Retrieval and indexing systems solve these problems by providing grounded, relevant, and searchable information to AI applications.

For the AI-103 exam, you should understand:

  • Different retrieval methods
  • Different indexing approaches
  • When to use vector search
  • When keyword search is appropriate
  • When hybrid search is preferred
  • How embeddings support retrieval
  • How Azure AI Search supports enterprise AI systems
  • How RAG architectures work

What Is Retrieval?

Retrieval is the process of locating and returning relevant information from a data source.

Examples include:

  • Searching documents
  • Finding relevant knowledge articles
  • Retrieving product descriptions
  • Returning similar documents
  • Finding semantically related content

Retrieval is essential for:

  • AI copilots
  • Enterprise chatbots
  • Knowledge assistants
  • Search applications
  • Recommendation systems
  • AI agents

What Is Indexing?

Indexing is the process of organizing data to make retrieval efficient.

An index acts like a searchable map of content.

Without indexing:

  • Searches are slower
  • Retrieval is inefficient
  • AI systems scale poorly

Indexes may include:

  • Keywords
  • Metadata
  • Embeddings
  • Semantic relationships
  • Document structure

Why Retrieval and Indexing Matter in AI

Modern generative AI applications often use Retrieval-Augmented Generation (RAG).

RAG combines:

  • Retrieval systems
  • Search indexes
  • Embeddings
  • LLMs

This allows AI systems to:

  • Access current information
  • Use enterprise knowledge
  • Reduce hallucinations
  • Provide grounded answers
  • Improve accuracy

Azure Services for Retrieval and Indexing

The primary Azure service for retrieval and indexing is:

  • Azure AI Search

Additional supporting services include:

  • Azure OpenAI
  • Embedding models
  • Azure Cosmos DB
  • Azure SQL Database
  • Azure Blob Storage

Azure AI Search

Azure AI Search is Microsoft’s enterprise search platform.

It supports:

  • Full-text search
  • Semantic search
  • Vector search
  • Hybrid search
  • AI enrichment
  • Indexing pipelines

Azure AI Search is a core AI-103 exam topic.


Retrieval Methods

There are several major retrieval methods you must understand for AI-103.


Keyword Search

What Is Keyword Search?

Keyword search retrieves documents based on exact word matches.

Example:

Searching for:

“cloud security”

Returns documents containing those exact terms.


Advantages of Keyword Search

  • Fast
  • Simple
  • Efficient for exact matches
  • Mature technology
  • Works well for structured terminology

Limitations of Keyword Search

Keyword search struggles with:

  • Synonyms
  • Contextual meaning
  • Natural language understanding
  • Conceptual similarity

Example:

A search for:

“car”

May not return documents containing:

“vehicle”


When to Use Keyword Search

Use keyword search when:

  • Exact term matching is important
  • Queries are highly structured
  • Performance and simplicity matter
  • Semantic understanding is unnecessary

Semantic Search

What Is Semantic Search?

Semantic search understands meaning and context rather than relying only on exact words.

It uses AI to interpret:

  • Intent
  • Context
  • Relationships between concepts

Example of Semantic Search

A query for:

“How do I secure cloud infrastructure?”

May retrieve documents about:

  • Azure security
  • Network protection
  • Cloud compliance

Even if the exact words differ.


Advantages of Semantic Search

  • Better contextual understanding
  • Improved relevance
  • More natural interactions
  • Better user experience

Limitations of Semantic Search

  • More computationally expensive
  • May increase latency
  • Requires more advanced indexing

When to Use Semantic Search

Use semantic search when:

  • Natural language queries are common
  • Relevance is important
  • Users may not know exact terminology
  • Context matters

Vector Search

What Is Vector Search?

Vector search retrieves information using embeddings.

Embeddings are numerical vector representations of content.

Documents with similar meaning have vectors that are mathematically close.


How Vector Search Works

  1. Documents are converted into embeddings
  2. Embeddings are stored in a vector index
  3. User queries are converted into embeddings
  4. Similarity algorithms identify related vectors
  5. Relevant documents are returned

Advantages of Vector Search

  • Excellent semantic similarity matching
  • Supports RAG architectures
  • Finds conceptually related content
  • Works well with natural language queries

Limitations of Vector Search

  • Higher storage requirements
  • More computational overhead
  • Requires embedding generation
  • More complex implementation

When to Use Vector Search

Use vector search when:

  • Building RAG systems
  • Implementing AI copilots
  • Performing semantic retrieval
  • Supporting conversational AI
  • Searching unstructured content

Hybrid Search

What Is Hybrid Search?

Hybrid search combines:

  • Keyword search
  • Semantic search
  • Vector search

This approach often produces the best retrieval quality.


Why Hybrid Search Matters

Hybrid search combines the strengths of multiple retrieval approaches.

Benefits include:

  • Exact keyword matching
  • Semantic understanding
  • Contextual similarity
  • Improved ranking quality

When to Use Hybrid Search

Use hybrid search when:

  • High retrieval quality is required
  • Enterprise search is needed
  • AI copilots require strong grounding
  • Search relevance is critical

Hybrid search is commonly used in production RAG systems.


Embeddings

What Are Embeddings?

Embeddings are numerical representations of data.

Embedding models transform:

  • Text
  • Images
  • Documents

Into vectors.

Embeddings capture semantic meaning.


Embedding Models

Azure OpenAI provides embedding models used for:

  • Vector search
  • Similarity matching
  • RAG systems
  • Recommendation systems

Chunking Strategies

What Is Chunking?

Chunking is the process of breaking large documents into smaller sections before indexing.

Chunking improves retrieval quality because:

  • Smaller chunks are easier to match
  • Context becomes more precise
  • Retrieval relevance improves

Common Chunking Methods

Fixed-Size Chunking

Documents are split into equal-sized chunks.

Advantages:

  • Simple
  • Easy to implement

Disadvantages:

  • May split important context

Semantic Chunking

Documents are split based on meaning or structure.

Advantages:

  • Better contextual integrity
  • Improved retrieval quality

Disadvantages:

  • More complex

Overlapping Chunks

Adjacent chunks share some content.

Advantages:

  • Preserves context continuity
  • Improves retrieval accuracy

Disadvantages:

  • Increased storage usage

Choosing a Chunking Strategy

Use Fixed-Size Chunking When:

  • Simplicity is important
  • Documents are uniform
  • Rapid implementation is needed

Use Semantic Chunking When:

  • Context preservation matters
  • Documents contain sections/topics
  • Retrieval quality is critical

Use Overlapping Chunks When:

  • Context continuity is important
  • Long-form content is indexed

Metadata Filtering

Indexes may include metadata such as:

  • Author
  • Date
  • Department
  • Category
  • Security level

Metadata filtering improves:

  • Precision
  • Security
  • Retrieval efficiency

Example Metadata Filtering Scenario

An enterprise chatbot retrieves only documents:

  • From HR
  • Created within the last year
  • Approved for employee access

Metadata filters help enforce these constraints.


Retrieval-Augmented Generation (RAG)

What Is RAG?

Retrieval-Augmented Generation combines retrieval systems with LLMs.

The workflow:

  1. User submits a query
  2. Query becomes an embedding
  3. Vector search retrieves relevant documents
  4. Retrieved content is added to the prompt
  5. LLM generates grounded response

Benefits of RAG

RAG helps:

  • Reduce hallucinations
  • Use current enterprise data
  • Avoid retraining models
  • Improve factual accuracy
  • Support enterprise AI assistants

Choosing Retrieval Methods for RAG

Keyword Search

Best for:

  • Exact terminology
  • Compliance searches
  • Structured queries

Vector Search

Best for:

  • Semantic similarity
  • Natural language queries
  • Conversational AI

Hybrid Search

Best for:

  • Enterprise copilots
  • High-quality retrieval
  • Production RAG systems

Indexing Pipelines

What Is an Indexing Pipeline?

An indexing pipeline automates:

  • Data ingestion
  • Document parsing
  • Chunking
  • Embedding generation
  • Metadata extraction
  • Index updates

AI Enrichment

Azure AI Search supports AI enrichment during indexing.

AI enrichment may include:

  • OCR
  • Entity extraction
  • Key phrase extraction
  • Language detection
  • Image analysis

Incremental Indexing

Incremental indexing updates only changed documents.

Benefits:

  • Faster indexing
  • Lower compute costs
  • Better scalability

Full Reindexing

Full reindexing rebuilds the entire index.

Use when:

  • Schema changes occur
  • Embedding models change
  • Large structural updates are required

Choosing an Indexing Strategy

Use Incremental Indexing When:

  • Data changes frequently
  • Efficiency matters
  • Large datasets exist

Use Full Reindexing When:

  • Major schema updates occur
  • Embedding strategy changes
  • Large-scale restructuring is required

Security and Access Control

Retrieval systems often include:

  • Role-based access control
  • Document-level security
  • Metadata-based filtering

This ensures users retrieve only authorized content.


Common AI-103 Scenarios

Scenario 1: Enterprise Knowledge Assistant

Requirements:

  • Conversational search
  • Semantic retrieval
  • Enterprise grounding

Recommended Approach:

  • Azure AI Search
  • Embeddings
  • Hybrid search
  • RAG

Scenario 2: Compliance Document Search

Requirements:

  • Exact terminology
  • Legal references
  • Precision retrieval

Recommended Approach:

  • Keyword search
  • Metadata filtering

Scenario 3: AI Copilot

Requirements:

  • Natural language queries
  • Contextual retrieval
  • Strong relevance

Recommended Approach:

  • Hybrid search
  • Vector search
  • Embeddings

Scenario 4: Product Recommendation System

Requirements:

  • Similarity matching
  • Semantic relationships

Recommended Approach:

  • Embeddings
  • Vector search

Common AI-103 Exam Tips

Understand Retrieval Tradeoffs

Keyword Search

  • Fast
  • Exact matching
  • Weak semantic understanding

Semantic Search

  • Better contextual understanding
  • More advanced relevance

Vector Search

  • Best for semantic similarity
  • Requires embeddings

Hybrid Search

  • Often best overall retrieval quality

Know the Relationship Between Embeddings and Vector Search

Embeddings enable vector search.

Without embeddings, vector search cannot function.


Understand RAG Architectures

RAG combines:

  • Retrieval
  • Indexing
  • Vector search
  • LLMs

This is one of the MOST important AI-103 topics.


Learn Chunking Concepts

Chunking affects:

  • Retrieval quality
  • Context preservation
  • Index efficiency

Chunking questions commonly appear in scenario-based exam questions.


Summary

Retrieval and indexing are foundational components of modern AI systems.

For the AI-103 exam, you should understand:

  • Keyword search
  • Semantic search
  • Vector search
  • Hybrid search
  • Embeddings
  • Chunking strategies
  • Metadata filtering
  • Indexing pipelines
  • Incremental indexing
  • RAG architectures
  • Azure AI Search capabilities

Choosing the correct retrieval and indexing approach directly affects:

  • AI accuracy
  • Groundedness
  • Scalability
  • Cost
  • Performance
  • User experience

Strong retrieval systems are essential for enterprise AI copilots, chatbots, and AI agents.


Practice Exam Questions

Question 1

Which retrieval method relies primarily on exact word matching?

A. Vector search
B. Semantic search
C. Keyword search
D. Hybrid search

Answer

C. Keyword search

Explanation

Keyword search retrieves content using exact lexical matches.


Question 2

Which retrieval method uses embeddings to identify semantically similar content?

A. Keyword search
B. Vector search
C. Lexical search
D. Metadata search

Answer

B. Vector search

Explanation

Vector search uses embeddings to perform similarity matching.


Question 3

What is the primary benefit of Retrieval-Augmented Generation (RAG)?

A. Eliminates embeddings
B. Improves groundedness using retrieved information
C. Removes the need for indexing
D. Replaces semantic search

Answer

B. Improves groundedness using retrieved information

Explanation

RAG improves factual accuracy by grounding responses with retrieved data.


Question 4

Which Azure service is MOST commonly used for enterprise vector search?

A. Azure AI Search
B. Azure DNS
C. Azure Backup
D. Azure Load Balancer

Answer

A. Azure AI Search

Explanation

Azure AI Search provides vector indexing and retrieval capabilities.


Question 5

What is the purpose of chunking during indexing?

A. Encrypt documents
B. Break documents into smaller searchable sections
C. Compress embeddings
D. Eliminate metadata

Answer

B. Break documents into smaller searchable sections

Explanation

Chunking improves retrieval quality and contextual matching.


Question 6

Which search method combines vector search, semantic ranking, and keyword matching?

A. Binary search
B. Metadata search
C. Hybrid search
D. OCR search

Answer

C. Hybrid search

Explanation

Hybrid search combines multiple retrieval methods.


Question 7

What is the primary purpose of embeddings?

A. Encrypt data
B. Create semantic vector representations
C. Compress images
D. Improve OCR quality

Answer

B. Create semantic vector representations

Explanation

Embeddings convert content into vectors representing semantic meaning.


Question 8

Which chunking strategy helps preserve context continuity between adjacent chunks?

A. Fixed chunking
B. Metadata chunking
C. Overlapping chunks
D. Compression chunking

Answer

C. Overlapping chunks

Explanation

Overlapping chunks preserve continuity across document sections.


Question 9

When is incremental indexing MOST appropriate?

A. When rebuilding the entire schema
B. When documents change frequently
C. When changing embedding models
D. When deleting the index

Answer

B. When documents change frequently

Explanation

Incremental indexing updates only modified documents.


Question 10

Which retrieval approach is MOST appropriate for enterprise AI copilots requiring high-quality relevance?

A. Keyword search only
B. Hybrid search
C. Metadata filtering only
D. OCR search

Answer

B. Hybrid search

Explanation

Hybrid search combines multiple retrieval methods for improved relevance.


Go to the AI-103 Exam Prep Hub main page

Leave a comment