This post is a part of the AI-103: Develop AI Apps and Agents on Azure Exam Prep Hub.
This topic falls under these sections:
Implement information extraction solutions (10–15%)
--> Build retrieval and grounding pipelines
--> Configure semantic search, hybrid search, and vector search for Grounding
Note that there are 10 practice questions (with answers and explanations) at the end of each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available from the hub's main page below the exam topics section.
Introduction
For the AI-103: Develop AI Apps and Agents on Azure certification exam, one of the most important modern AI concepts is understanding how to configure and use:
- Semantic search
- Vector search
- Hybrid search
These technologies are foundational to:
- Retrieval-Augmented Generation (RAG)
- AI agents
- Enterprise copilots
- Knowledge mining systems
- Grounded AI applications
In modern Azure AI architectures, these search methods help Large Language Models (LLMs) retrieve relevant enterprise content so responses are accurate, current, and grounded in trusted data.
Why Grounding Matters
LLMs such as those used through Azure OpenAI Service are powerful, but they have limitations:
- They may hallucinate
- Their training data may be outdated
- They do not automatically know private organizational data
- They cannot inherently access enterprise documents
Grounding solves this problem.
What Is Grounding?
Grounding means providing an AI model with relevant external data during inference.
Example:
User Question:"What is our company travel reimbursement policy?"AI Workflow:1. Retrieve policy document chunks2. Provide chunks to LLM3. Generate grounded answer
Without grounding, the model might invent an answer.
With grounding, the response is based on actual company documentation.
Core Azure Services Used
Several Azure services commonly appear in grounding architectures.
| Service | Purpose |
|---|---|
| Azure AI Search | Search indexes, vector search, semantic ranking |
| Azure OpenAI Service | Embeddings generation and LLM responses |
| Azure Blob Storage | Store source documents |
| Azure AI Document Intelligence | Extract document content |
| Azure AI Foundry | Build AI agents and orchestration workflows |
Understanding Search Types
There are three major search approaches you must understand for AI-103:
| Search Type | Main Purpose |
|---|---|
| Keyword Search | Exact text matching |
| Semantic Search | Meaning-based ranking |
| Vector Search | Embedding similarity |
| Hybrid Search | Combines keyword + semantic + vector |
Traditional Keyword Search
Traditional search relies on:
- Exact matches
- Tokens
- Lexical analysis
Example:
Search Query:"reset password"
Documents containing:
"reset password"
will rank highly.
However, keyword search struggles with:
- Synonyms
- Context
- Natural language intent
Example:
"change account credentials"
may not match well.
Semantic Search
What Is Semantic Search?
Semantic search improves retrieval by understanding:
- Context
- Meaning
- Intent
- Relationships between words
Instead of only exact keywords, semantic search uses language understanding to improve ranking quality.
How Semantic Search Works
Semantic search:
- Interprets user intent
- Understands relationships between phrases
- Re-ranks search results
- Produces more relevant answers
Example:
User Query:"How do I update my login information?"
Semantic search may retrieve:
"Instructions for changing account credentials"
even without exact keyword matches.
Semantic Ranking
In Azure AI Search, semantic ranking:
- Reorders results based on relevance
- Uses deep language models
- Improves natural language search experiences
Important AI-103 point:
Semantic search enhances ranking, but it does not replace vector search.
Semantic Captions and Answers
Azure AI Search semantic search can generate:
- Semantic captions
- Semantic answers
Semantic Captions
Short highlighted summaries from documents.
Semantic Answers
Direct answers extracted from indexed content.
Example:
Question:"What is the vacation accrual policy?"Semantic answer:"Employees accrue 10 vacation days annually."
Vector Search
What Is Vector Search?
Vector search uses embeddings to retrieve semantically similar content.
Instead of matching keywords, vector search compares numerical vectors.
What Are Embeddings?
Embeddings are numerical representations of content.
Words or concepts with similar meanings are placed near each other in vector space.
Example:
"car""automobile""vehicle"
These concepts become mathematically similar vectors.
Embedding Generation
Embeddings are commonly generated using models in:
- Azure OpenAI Service
- Azure AI Foundry models
Typical embedding workflow:
- Chunk documents
- Generate embeddings
- Store vectors in search index
- Generate embedding for user query
- Retrieve nearest vectors
Vector Search Workflow
Document Chunk ↓Embedding Model ↓Vector Embedding ↓Stored in Search Index
Query workflow:
User Query ↓Embedding Model ↓Query Vector ↓Nearest Neighbor Search
Nearest Neighbor Search
Vector databases use similarity calculations such as:
- Cosine similarity
- Euclidean distance
The system retrieves content with the closest vectors.
Important exam concept:
Vector similarity measures semantic closeness.
Configuring Vector Search in Azure AI Search
To configure vector search, you typically:
- Create vector-enabled fields
- Generate embeddings
- Store embeddings in index
- Configure vector search profiles
- Execute vector queries
Example Vector Index Structure
Example fields:
| Field | Type |
|---|---|
| id | String |
| content | String |
| contentVector | Collection(Float) |
| title | String |
The vector field stores embeddings.
Vector Dimensions
Embedding models produce vectors with fixed dimensions.
Example:
1536 dimensions
Important:
The vector field dimension must match the embedding model output.
Hybrid Search
What Is Hybrid Search?
Hybrid search combines:
- Keyword search
- Semantic ranking
- Vector similarity
This is one of the most important AI-103 topics.
Why Hybrid Search Matters
Each search method has strengths and weaknesses.
| Method | Strength |
|---|---|
| Keyword search | Exact matching |
| Semantic search | Better ranking/context |
| Vector search | Conceptual similarity |
Hybrid search combines all three for optimal retrieval quality.
Hybrid Search Architecture
User Query ↓Keyword Search +Vector Search ↓Combined Results ↓Semantic Re-ranking ↓Top Grounding Results
This architecture is extremely common in enterprise RAG systems.
Why Hybrid Search Is Recommended
Hybrid search improves:
- Recall
- Precision
- Relevance
- Context matching
- Grounding quality
This reduces hallucinations and improves AI responses.
Retrieval-Augmented Generation (RAG)
What Is RAG?
RAG combines:
- Retrieval systems
- External knowledge
- Generative AI
Workflow:
User Query ↓Search Retrieval ↓Relevant Chunks ↓LLM Prompt ↓Grounded Response
Grounding Pipeline Example
Documents in Blob Storage ↓Azure AI Search Indexer ↓Chunking ↓Embedding Generation ↓Vector Index ↓Hybrid Search Retrieval ↓Azure OpenAI Prompt ↓Grounded Response
This pipeline appears frequently in AI-103 scenarios.
Chunking and Retrieval Quality
Chunking directly affects search quality.
Good chunks:
- Preserve meaning
- Fit token limits
- Improve embedding relevance
Poor chunking causes:
- Incomplete answers
- Lost context
- Lower retrieval accuracy
Semantic vs Vector Search
| Semantic Search | Vector Search |
|---|---|
| Improves ranking | Retrieves by embedding similarity |
| Language understanding | Numerical vector comparison |
| Works with textual relevance | Works with semantic proximity |
| Re-ranking layer | Retrieval mechanism |
Important:
These technologies complement each other.
Filtering in Grounding Pipelines
Metadata filtering improves retrieval quality.
Common filters:
- Department
- Security level
- Document type
- Date
- Language
Example:
department = Finance
This limits retrieval scope.
Security Trimming
Enterprise grounding systems often require:
- RBAC
- Document-level security
- Identity-aware retrieval
Important exam concept:
Users should retrieve only authorized content.
Performance Optimization
Key optimization techniques:
- Proper chunk sizes
- Embedding caching
- Hybrid search
- Metadata filtering
- Incremental indexing
- Semantic ranking
Common AI-103 Scenarios
Scenario 1
You need a chatbot that answers using internal PDFs.
Solution:
- Azure AI Search
- Embeddings
- Vector search
- Hybrid search
- Azure OpenAI
Scenario 2
You need better ranking for natural language queries.
Solution:
- Semantic search
- Semantic ranking
Scenario 3
You need concept-based retrieval rather than keyword matching.
Solution:
- Vector search
Scenario 4
You need maximum retrieval accuracy.
Solution:
- Hybrid search
Important AI-103 Exam Tips
Know These Core Concepts
| Concept | Key Purpose |
|---|---|
| Embeddings | Vector representation |
| Vector search | Semantic retrieval |
| Semantic ranking | Better result ordering |
| Hybrid search | Combined retrieval |
| Grounding | Providing trusted context |
| Chunking | Breaking documents into manageable pieces |
Frequently Tested Knowledge Areas
Expect questions involving:
- RAG architectures
- Embedding generation
- Vector-enabled indexes
- Hybrid retrieval
- Semantic ranking
- Grounding pipelines
- Azure AI Search configuration
- Chunking strategies
Final Thoughts
Semantic search, vector search, and hybrid search are foundational technologies for modern AI systems on Azure.
For AI-103, focus heavily on:
- How embeddings work
- When to use vector search
- Why hybrid search is recommended
- How semantic ranking improves results
- How grounding reduces hallucinations
- How Azure AI Search integrates with Azure OpenAI
These concepts are central to enterprise AI agents, copilots, and generative AI applications.
Practice Exam Questions
Question 1
What is the primary purpose of grounding in a generative AI solution?
A. Reduce storage costs
B. Train foundation models
C. Provide trusted external context to the LLM
D. Encrypt embeddings
Answer
C. Provide trusted external context to the LLM
Question 2
Which Azure service commonly provides vector search capabilities?
A. Azure Monitor
B. Azure AI Search
C. Azure Virtual Machines
D. Azure Backup
Answer
B. Azure AI Search
Question 3
What are embeddings used for in vector search?
A. Encryption
B. Data compression
C. Numerical semantic representations
D. OCR processing
Answer
C. Numerical semantic representations
Question 4
Which search type is best at retrieving semantically similar concepts even when keywords differ?
A. Boolean search
B. Lexical search
C. Metadata search
D. Vector search
Answer
D. Vector search
Question 5
What does hybrid search combine?
A. OCR and translation
B. Keyword and vector search
C. SQL and NoSQL databases
D. Blob storage and Cosmos DB
Answer
B. Keyword and vector search
Question 6
What is the role of semantic ranking in Azure AI Search?
A. Improve relevance ordering of results
B. Encrypt search indexes
C. Generate embeddings
D. Compress vectors
Answer
A. Improve relevance ordering of results
Question 7
Which process converts text into numerical vectors?
A. OCR
B. Tokenization
C. Embedding generation
D. Semantic ranking
Answer
C. Embedding generation
Question 8
Why is chunking important in grounding pipelines?
A. It removes duplicate users
B. It reduces RBAC complexity
C. It improves retrieval relevance and token management
D. It encrypts documents
Answer
C. It improves retrieval relevance and token management
Question 9
Which search approach generally provides the best retrieval quality for enterprise RAG applications?
A. Keyword search only
B. Vector search only
C. SQL full-text search
D. Hybrid search
Answer
D. Hybrid search
Question 10
Which statement best describes semantic search?
A. It only retrieves exact keyword matches
B. It uses language understanding to improve relevance
C. It replaces embeddings entirely
D. It only works on structured databases
Answer
B. It uses language understanding to improve relevance
Go to the AI-103 Exam Prep Hub main page
