This post is a part of the AI-103: Develop AI Apps and Agents on Azure Exam Prep Hub.
This topic falls under these sections:
Plan and manage an Azure AI solution (25–30%)
--> Choose the appropriate Foundry services for generative AI and agents
--> Choose an appropriate method for retrieval and indexing
Note that there are 10 practice questions (with answers and explanations) at the end of each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available from the hub's main page below the exam topics section.
Introduction
One of the most important concepts in modern AI applications is the ability to retrieve the correct information efficiently and accurately.
The AI-103: Develop AI Apps and Agents on Azure certification exam heavily tests knowledge related to:
- Retrieval methods
- Indexing strategies
- Vector search
- Semantic search
- Retrieval-Augmented Generation (RAG)
- Hybrid search
- Embeddings
- Knowledge grounding
Modern AI systems are often only as effective as their retrieval systems.
Even highly advanced Large Language Models (LLMs) can:
- Hallucinate
- Provide outdated information
- Miss relevant context
Retrieval and indexing systems solve these problems by providing grounded, relevant, and searchable information to AI applications.
For the AI-103 exam, you should understand:
- Different retrieval methods
- Different indexing approaches
- When to use vector search
- When keyword search is appropriate
- When hybrid search is preferred
- How embeddings support retrieval
- How Azure AI Search supports enterprise AI systems
- How RAG architectures work
What Is Retrieval?
Retrieval is the process of locating and returning relevant information from a data source.
Examples include:
- Searching documents
- Finding relevant knowledge articles
- Retrieving product descriptions
- Returning similar documents
- Finding semantically related content
Retrieval is essential for:
- AI copilots
- Enterprise chatbots
- Knowledge assistants
- Search applications
- Recommendation systems
- AI agents
What Is Indexing?
Indexing is the process of organizing data to make retrieval efficient.
An index acts like a searchable map of content.
Without indexing:
- Searches are slower
- Retrieval is inefficient
- AI systems scale poorly
Indexes may include:
- Keywords
- Metadata
- Embeddings
- Semantic relationships
- Document structure
Why Retrieval and Indexing Matter in AI
Modern generative AI applications often use Retrieval-Augmented Generation (RAG).
RAG combines:
- Retrieval systems
- Search indexes
- Embeddings
- LLMs
This allows AI systems to:
- Access current information
- Use enterprise knowledge
- Reduce hallucinations
- Provide grounded answers
- Improve accuracy
Azure Services for Retrieval and Indexing
The primary Azure service for retrieval and indexing is:
- Azure AI Search
Additional supporting services include:
- Azure OpenAI
- Embedding models
- Azure Cosmos DB
- Azure SQL Database
- Azure Blob Storage
Azure AI Search
Azure AI Search is Microsoft’s enterprise search platform.
It supports:
- Full-text search
- Semantic search
- Vector search
- Hybrid search
- AI enrichment
- Indexing pipelines
Azure AI Search is a core AI-103 exam topic.
Retrieval Methods
There are several major retrieval methods you must understand for AI-103.
Keyword Search
What Is Keyword Search?
Keyword search retrieves documents based on exact word matches.
Example:
Searching for:
“cloud security”
Returns documents containing those exact terms.
Advantages of Keyword Search
- Fast
- Simple
- Efficient for exact matches
- Mature technology
- Works well for structured terminology
Limitations of Keyword Search
Keyword search struggles with:
- Synonyms
- Contextual meaning
- Natural language understanding
- Conceptual similarity
Example:
A search for:
“car”
May not return documents containing:
“vehicle”
When to Use Keyword Search
Use keyword search when:
- Exact term matching is important
- Queries are highly structured
- Performance and simplicity matter
- Semantic understanding is unnecessary
Semantic Search
What Is Semantic Search?
Semantic search understands meaning and context rather than relying only on exact words.
It uses AI to interpret:
- Intent
- Context
- Relationships between concepts
Example of Semantic Search
A query for:
“How do I secure cloud infrastructure?”
May retrieve documents about:
- Azure security
- Network protection
- Cloud compliance
Even if the exact words differ.
Advantages of Semantic Search
- Better contextual understanding
- Improved relevance
- More natural interactions
- Better user experience
Limitations of Semantic Search
- More computationally expensive
- May increase latency
- Requires more advanced indexing
When to Use Semantic Search
Use semantic search when:
- Natural language queries are common
- Relevance is important
- Users may not know exact terminology
- Context matters
Vector Search
What Is Vector Search?
Vector search retrieves information using embeddings.
Embeddings are numerical vector representations of content.
Documents with similar meaning have vectors that are mathematically close.
How Vector Search Works
- Documents are converted into embeddings
- Embeddings are stored in a vector index
- User queries are converted into embeddings
- Similarity algorithms identify related vectors
- Relevant documents are returned
Advantages of Vector Search
- Excellent semantic similarity matching
- Supports RAG architectures
- Finds conceptually related content
- Works well with natural language queries
Limitations of Vector Search
- Higher storage requirements
- More computational overhead
- Requires embedding generation
- More complex implementation
When to Use Vector Search
Use vector search when:
- Building RAG systems
- Implementing AI copilots
- Performing semantic retrieval
- Supporting conversational AI
- Searching unstructured content
Hybrid Search
What Is Hybrid Search?
Hybrid search combines:
- Keyword search
- Semantic search
- Vector search
This approach often produces the best retrieval quality.
Why Hybrid Search Matters
Hybrid search combines the strengths of multiple retrieval approaches.
Benefits include:
- Exact keyword matching
- Semantic understanding
- Contextual similarity
- Improved ranking quality
When to Use Hybrid Search
Use hybrid search when:
- High retrieval quality is required
- Enterprise search is needed
- AI copilots require strong grounding
- Search relevance is critical
Hybrid search is commonly used in production RAG systems.
Embeddings
What Are Embeddings?
Embeddings are numerical representations of data.
Embedding models transform:
- Text
- Images
- Documents
Into vectors.
Embeddings capture semantic meaning.
Embedding Models
Azure OpenAI provides embedding models used for:
- Vector search
- Similarity matching
- RAG systems
- Recommendation systems
Chunking Strategies
What Is Chunking?
Chunking is the process of breaking large documents into smaller sections before indexing.
Chunking improves retrieval quality because:
- Smaller chunks are easier to match
- Context becomes more precise
- Retrieval relevance improves
Common Chunking Methods
Fixed-Size Chunking
Documents are split into equal-sized chunks.
Advantages:
- Simple
- Easy to implement
Disadvantages:
- May split important context
Semantic Chunking
Documents are split based on meaning or structure.
Advantages:
- Better contextual integrity
- Improved retrieval quality
Disadvantages:
- More complex
Overlapping Chunks
Adjacent chunks share some content.
Advantages:
- Preserves context continuity
- Improves retrieval accuracy
Disadvantages:
- Increased storage usage
Choosing a Chunking Strategy
Use Fixed-Size Chunking When:
- Simplicity is important
- Documents are uniform
- Rapid implementation is needed
Use Semantic Chunking When:
- Context preservation matters
- Documents contain sections/topics
- Retrieval quality is critical
Use Overlapping Chunks When:
- Context continuity is important
- Long-form content is indexed
Metadata Filtering
Indexes may include metadata such as:
- Author
- Date
- Department
- Category
- Security level
Metadata filtering improves:
- Precision
- Security
- Retrieval efficiency
Example Metadata Filtering Scenario
An enterprise chatbot retrieves only documents:
- From HR
- Created within the last year
- Approved for employee access
Metadata filters help enforce these constraints.
Retrieval-Augmented Generation (RAG)
What Is RAG?
Retrieval-Augmented Generation combines retrieval systems with LLMs.
The workflow:
- User submits a query
- Query becomes an embedding
- Vector search retrieves relevant documents
- Retrieved content is added to the prompt
- LLM generates grounded response
Benefits of RAG
RAG helps:
- Reduce hallucinations
- Use current enterprise data
- Avoid retraining models
- Improve factual accuracy
- Support enterprise AI assistants
Choosing Retrieval Methods for RAG
Keyword Search
Best for:
- Exact terminology
- Compliance searches
- Structured queries
Vector Search
Best for:
- Semantic similarity
- Natural language queries
- Conversational AI
Hybrid Search
Best for:
- Enterprise copilots
- High-quality retrieval
- Production RAG systems
Indexing Pipelines
What Is an Indexing Pipeline?
An indexing pipeline automates:
- Data ingestion
- Document parsing
- Chunking
- Embedding generation
- Metadata extraction
- Index updates
AI Enrichment
Azure AI Search supports AI enrichment during indexing.
AI enrichment may include:
- OCR
- Entity extraction
- Key phrase extraction
- Language detection
- Image analysis
Incremental Indexing
Incremental indexing updates only changed documents.
Benefits:
- Faster indexing
- Lower compute costs
- Better scalability
Full Reindexing
Full reindexing rebuilds the entire index.
Use when:
- Schema changes occur
- Embedding models change
- Large structural updates are required
Choosing an Indexing Strategy
Use Incremental Indexing When:
- Data changes frequently
- Efficiency matters
- Large datasets exist
Use Full Reindexing When:
- Major schema updates occur
- Embedding strategy changes
- Large-scale restructuring is required
Security and Access Control
Retrieval systems often include:
- Role-based access control
- Document-level security
- Metadata-based filtering
This ensures users retrieve only authorized content.
Common AI-103 Scenarios
Scenario 1: Enterprise Knowledge Assistant
Requirements:
- Conversational search
- Semantic retrieval
- Enterprise grounding
Recommended Approach:
- Azure AI Search
- Embeddings
- Hybrid search
- RAG
Scenario 2: Compliance Document Search
Requirements:
- Exact terminology
- Legal references
- Precision retrieval
Recommended Approach:
- Keyword search
- Metadata filtering
Scenario 3: AI Copilot
Requirements:
- Natural language queries
- Contextual retrieval
- Strong relevance
Recommended Approach:
- Hybrid search
- Vector search
- Embeddings
Scenario 4: Product Recommendation System
Requirements:
- Similarity matching
- Semantic relationships
Recommended Approach:
- Embeddings
- Vector search
Common AI-103 Exam Tips
Understand Retrieval Tradeoffs
Keyword Search
- Fast
- Exact matching
- Weak semantic understanding
Semantic Search
- Better contextual understanding
- More advanced relevance
Vector Search
- Best for semantic similarity
- Requires embeddings
Hybrid Search
- Often best overall retrieval quality
Know the Relationship Between Embeddings and Vector Search
Embeddings enable vector search.
Without embeddings, vector search cannot function.
Understand RAG Architectures
RAG combines:
- Retrieval
- Indexing
- Vector search
- LLMs
This is one of the MOST important AI-103 topics.
Learn Chunking Concepts
Chunking affects:
- Retrieval quality
- Context preservation
- Index efficiency
Chunking questions commonly appear in scenario-based exam questions.
Summary
Retrieval and indexing are foundational components of modern AI systems.
For the AI-103 exam, you should understand:
- Keyword search
- Semantic search
- Vector search
- Hybrid search
- Embeddings
- Chunking strategies
- Metadata filtering
- Indexing pipelines
- Incremental indexing
- RAG architectures
- Azure AI Search capabilities
Choosing the correct retrieval and indexing approach directly affects:
- AI accuracy
- Groundedness
- Scalability
- Cost
- Performance
- User experience
Strong retrieval systems are essential for enterprise AI copilots, chatbots, and AI agents.
Practice Exam Questions
Question 1
Which retrieval method relies primarily on exact word matching?
A. Vector search
B. Semantic search
C. Keyword search
D. Hybrid search
Answer
C. Keyword search
Explanation
Keyword search retrieves content using exact lexical matches.
Question 2
Which retrieval method uses embeddings to identify semantically similar content?
A. Keyword search
B. Vector search
C. Lexical search
D. Metadata search
Answer
B. Vector search
Explanation
Vector search uses embeddings to perform similarity matching.
Question 3
What is the primary benefit of Retrieval-Augmented Generation (RAG)?
A. Eliminates embeddings
B. Improves groundedness using retrieved information
C. Removes the need for indexing
D. Replaces semantic search
Answer
B. Improves groundedness using retrieved information
Explanation
RAG improves factual accuracy by grounding responses with retrieved data.
Question 4
Which Azure service is MOST commonly used for enterprise vector search?
A. Azure AI Search
B. Azure DNS
C. Azure Backup
D. Azure Load Balancer
Answer
A. Azure AI Search
Explanation
Azure AI Search provides vector indexing and retrieval capabilities.
Question 5
What is the purpose of chunking during indexing?
A. Encrypt documents
B. Break documents into smaller searchable sections
C. Compress embeddings
D. Eliminate metadata
Answer
B. Break documents into smaller searchable sections
Explanation
Chunking improves retrieval quality and contextual matching.
Question 6
Which search method combines vector search, semantic ranking, and keyword matching?
A. Binary search
B. Metadata search
C. Hybrid search
D. OCR search
Answer
C. Hybrid search
Explanation
Hybrid search combines multiple retrieval methods.
Question 7
What is the primary purpose of embeddings?
A. Encrypt data
B. Create semantic vector representations
C. Compress images
D. Improve OCR quality
Answer
B. Create semantic vector representations
Explanation
Embeddings convert content into vectors representing semantic meaning.
Question 8
Which chunking strategy helps preserve context continuity between adjacent chunks?
A. Fixed chunking
B. Metadata chunking
C. Overlapping chunks
D. Compression chunking
Answer
C. Overlapping chunks
Explanation
Overlapping chunks preserve continuity across document sections.
Question 9
When is incremental indexing MOST appropriate?
A. When rebuilding the entire schema
B. When documents change frequently
C. When changing embedding models
D. When deleting the index
Answer
B. When documents change frequently
Explanation
Incremental indexing updates only modified documents.
Question 10
Which retrieval approach is MOST appropriate for enterprise AI copilots requiring high-quality relevance?
A. Keyword search only
B. Hybrid search
C. Metadata filtering only
D. OCR search
Answer
B. Hybrid search
Explanation
Hybrid search combines multiple retrieval methods for improved relevance.
Go to the AI-103 Exam Prep Hub main page
