Tag: AI Models

Identify an appropriate AI model, based on capabilities (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Identify AI concepts and capabilities (40–45%)
--> Identify AI model components and configurations
--> Identify an appropriate AI model, based on capabilities


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Selecting the correct AI model for a specific business problem is an important skill and a key topic for the AI-901 certification exam. Microsoft expects candidates to understand the capabilities of common AI model types and recognize which model is appropriate for different scenarios.

This topic falls under the “Identify AI model components and configurations” section of the exam objectives.


Why Choosing the Right AI Model Matters

Different AI models are designed for different types of tasks.

Choosing the wrong model may lead to:

  • Poor accuracy
  • Inefficient processing
  • Increased costs
  • Unusable results
  • Poor user experiences

Understanding model capabilities helps organizations build effective AI solutions.


Major Categories of AI Models

For AI-901, you should understand the capabilities of several major AI model categories:

  • Classification models
  • Regression models
  • Clustering models
  • Computer vision models
  • Natural language processing (NLP) models
  • Generative AI models
  • Recommendation systems
  • Anomaly detection models

Classification Models

Classification models predict categories or labels.

They answer questions such as:

  • “What type is this?”
  • “Which category does this belong to?”

Common Use Cases

  • Spam email detection
  • Fraud detection
  • Sentiment analysis
  • Medical diagnosis classification
  • Image categorization

Example

A model predicts whether an email is:

  • Spam
  • Not spam

This is a classification problem.


Binary Classification

Binary classification predicts one of two possible outcomes.

Examples

  • Fraud or not fraud
  • Approved or denied
  • Positive or negative sentiment

Multiclass Classification

Multiclass classification predicts one of several categories.

Example

An AI model identifies whether an image contains:

  • A dog
  • A cat
  • A bird
  • A horse

Regression Models

Regression models predict numeric values.

They answer questions such as:

  • “How much?”
  • “How many?”
  • “What value?”

Common Use Cases

  • House price prediction
  • Sales forecasting
  • Temperature prediction
  • Demand estimation

Example

Predicting the selling price of a house based on:

  • Size
  • Location
  • Number of bedrooms

This is a regression problem.


Clustering Models

Clustering models group similar items together without predefined labels.

Clustering is a type of unsupervised learning.

Common Use Cases

  • Customer segmentation
  • Market analysis
  • Pattern discovery
  • Grouping similar documents

Example

A retailer groups customers based on purchasing behavior.

The model discovers patterns automatically.


Computer Vision Models

Computer vision models analyze images and video.

Common Capabilities

  • Object detection
  • Facial recognition
  • Image classification
  • Optical Character Recognition (OCR)
  • Image tagging

Example Use Cases

  • Self-driving cars
  • Security systems
  • Medical imaging
  • Product identification

Image Classification

Image classification identifies what appears in an image.

Example

Determining whether an image contains:

  • A cat
  • A dog
  • A car

Object Detection

Object detection identifies and locates objects within an image.

Example

A traffic monitoring system detects:

  • Cars
  • Pedestrians
  • Traffic lights

and determines their positions.


Optical Character Recognition (OCR)

OCR extracts text from images or scanned documents.

Example

Reading text from:

  • Receipts
  • Invoices
  • Forms
  • License plates

Natural Language Processing (NLP) Models

NLP models work with human language.

Common Capabilities

  • Sentiment analysis
  • Translation
  • Text summarization
  • Chatbots
  • Speech recognition
  • Named entity recognition

Example Use Cases

  • Customer support chatbots
  • Language translation apps
  • Voice assistants

Sentiment Analysis

Sentiment analysis identifies emotional tone in text.

Example

Determining whether a product review is:

  • Positive
  • Negative
  • Neutral

Translation Models

Translation models convert text between languages.

Example

Converting English text into Spanish.


Speech Recognition

Speech recognition converts spoken language into text.

Example

Voice assistants converting speech commands into written text.


Generative AI Models

Generative AI models create new content.

Common Outputs

  • Text
  • Images
  • Audio
  • Video
  • Code

Example Use Cases

  • AI chatbots
  • Content generation
  • Image creation
  • Coding assistants

Large Language Models (LLMs)

LLMs are generative AI models focused on language tasks.

Capabilities

  • Conversations
  • Summarization
  • Question answering
  • Content generation
  • Code generation

Example

An AI assistant answering user questions in natural language.


Recommendation Systems

Recommendation systems suggest items users may prefer.

Common Use Cases

  • Product recommendations
  • Movie recommendations
  • Music recommendations
  • Online advertising

Example

An online retailer recommends products based on browsing history.


Anomaly Detection Models

Anomaly detection models identify unusual patterns or behaviors.

Common Use Cases

  • Fraud detection
  • Cybersecurity monitoring
  • Equipment failure prediction
  • Network intrusion detection

Example

A bank identifies suspicious credit card transactions.


Supervised vs. Unsupervised Learning

Understanding learning types helps identify appropriate models.

Learning TypeDescription
Supervised LearningUses labeled data
Unsupervised LearningFinds patterns without labels

Supervised Examples

  • Classification
  • Regression

Unsupervised Examples

  • Clustering
  • Some anomaly detection systems

Choosing the Right AI Model

To select an appropriate AI model, ask:


What Type of Output Is Needed?

GoalModel Type
Predict categoriesClassification
Predict numbersRegression
Group similar itemsClustering
Generate contentGenerative AI
Analyze imagesComputer Vision
Process languageNLP

Is the Data Labeled?

Data TypeAppropriate Learning Type
Labeled dataSupervised learning
Unlabeled dataUnsupervised learning

What Content Is Being Processed?

Content TypeAppropriate Model
TextNLP or LLM
ImagesComputer Vision
AudioSpeech models
Numerical dataRegression or classification

Real-World Examples


Scenario 1: Email Spam Detection

Goal

Identify whether emails are spam.

Best Model

Classification model


Scenario 2: Predicting House Prices

Goal

Estimate home values.

Best Model

Regression model


Scenario 3: Grouping Customers by Buying Behavior

Goal

Identify customer segments.

Best Model

Clustering model


Scenario 4: AI Chatbot

Goal

Generate conversational responses.

Best Model

Large Language Model (LLM)


Scenario 5: Reading Text from Scanned Documents

Goal

Extract printed text.

Best Model

OCR computer vision model


Scenario 6: Detecting Fraudulent Transactions

Goal

Identify suspicious activity.

Best Model

Anomaly detection model


Azure AI Services and Model Types

Microsoft Azure AI Services provide many prebuilt AI capabilities, including:

  • Vision services
  • Speech services
  • Language services
  • Generative AI tools
  • Document intelligence
  • Recommendation capabilities

Microsoft Azure helps organizations apply the correct AI models to different business scenarios.


Responsible AI Considerations

When selecting AI models, organizations should also consider:

  • Fairness
  • Transparency
  • Privacy
  • Reliability
  • Inclusiveness
  • Accountability

A technically accurate model may still create ethical or operational concerns if deployed improperly.


Important AI-901 Exam Tips

For the exam, remember these key points:

  • Classification predicts categories.
  • Regression predicts numeric values.
  • Clustering groups similar items.
  • NLP models process language.
  • Computer vision models process images and video.
  • Generative AI creates new content.
  • Recommendation systems suggest relevant items.
  • Anomaly detection identifies unusual behavior.
  • LLMs are generative AI models for language tasks.
  • OCR extracts text from images or documents.

Quick Knowledge Check

Question 1

Which model type is best for predicting numeric values?

Answer

Regression models.


Question 2

Which AI capability is used to extract text from scanned documents?

Answer

Optical Character Recognition (OCR).


Question 3

What type of model is typically used for chatbots that generate responses?

Answer

Large Language Models (LLMs).


Question 4

Which learning type uses unlabeled data?

Answer

Unsupervised learning.


Practice Exam Questions

Question 1

A company wants to predict future monthly sales revenue based on historical sales data.

Which type of AI model is MOST appropriate?

A. Classification
B. Regression
C. Clustering
D. Computer vision


Correct Answer

B. Regression


Explanation

Regression models are used to predict numeric values such as revenue, prices, or temperatures.


Why the Other Answers Are Incorrect

A. Classification

Classification predicts categories, not numeric values.

C. Clustering

Clustering groups similar items.

D. Computer vision

Computer vision processes images and video.


Question 2

An organization wants to identify whether emails are spam or not spam.

Which type of AI model should be used?

A. Regression
B. Clustering
C. Classification
D. OCR


Correct Answer

C. Classification


Explanation

Spam detection is a classification problem because the output belongs to predefined categories: spam or not spam.


Why the Other Answers Are Incorrect

A. Regression

Regression predicts numeric values.

B. Clustering

Clustering groups unlabeled data.

D. OCR

OCR extracts text from images.


Question 3

Which AI capability is MOST appropriate for extracting text from scanned documents?

A. Object detection
B. OCR
C. Regression
D. Recommendation system


Correct Answer

B. OCR


Explanation

Optical Character Recognition (OCR) extracts printed or handwritten text from images or scanned documents.


Why the Other Answers Are Incorrect

A. Object detection

Object detection identifies objects within images.

C. Regression

Regression predicts numeric values.

D. Recommendation system

Recommendation systems suggest items to users.


Question 4

A retailer wants to group customers based on purchasing behavior without predefined labels.

Which type of AI model is MOST appropriate?

A. Classification
B. Regression
C. Clustering
D. Translation


Correct Answer

C. Clustering


Explanation

Clustering models group similar data points together without labeled categories.


Why the Other Answers Are Incorrect

A. Classification

Classification requires labeled categories.

B. Regression

Regression predicts numbers.

D. Translation

Translation converts text between languages.


Question 5

Which type of AI model is BEST suited for generating natural language responses in a chatbot?

A. Large Language Model (LLM)
B. Regression model
C. Clustering model
D. Decision tree only


Correct Answer

A. Large Language Model (LLM)


Explanation

LLMs are generative AI models designed for language tasks such as conversation, summarization, and question answering.


Why the Other Answers Are Incorrect

B. Regression model

Regression predicts numeric values.

C. Clustering model

Clustering groups similar data.

D. Decision tree only

Decision trees are not specialized for conversational text generation.


Question 6

A bank wants to identify suspicious credit card transactions that differ from normal spending patterns.

Which AI capability is MOST appropriate?

A. Sentiment analysis
B. Anomaly detection
C. OCR
D. Image classification


Correct Answer

B. Anomaly detection


Explanation

Anomaly detection models identify unusual or abnormal behavior that may indicate fraud or security issues.


Why the Other Answers Are Incorrect

A. Sentiment analysis

Sentiment analysis evaluates emotional tone in text.

C. OCR

OCR extracts text from images.

D. Image classification

Image classification categorizes images.


Question 7

What is the PRIMARY capability of a computer vision model?

A. Predicting stock prices
B. Processing and analyzing visual content such as images and video
C. Translating text between languages
D. Generating database queries


Correct Answer

B. Processing and analyzing visual content such as images and video


Explanation

Computer vision models work with images and video to identify objects, text, faces, and other visual information.


Why the Other Answers Are Incorrect

A. Predicting stock prices

This is typically a regression problem.

C. Translating text between languages

Translation is an NLP task.

D. Generating database queries

This is not the primary role of computer vision.


Question 8

A streaming service suggests movies based on a user’s viewing history.

Which AI capability is being used?

A. Recommendation system
B. OCR
C. Regression
D. Object detection


Correct Answer

A. Recommendation system


Explanation

Recommendation systems suggest products, movies, music, or other items based on user behavior and preferences.


Why the Other Answers Are Incorrect

B. OCR

OCR extracts text from images.

C. Regression

Regression predicts numeric values.

D. Object detection

Object detection identifies objects in images.


Question 9

Which type of AI model would MOST likely be used for language translation?

A. NLP model
B. Clustering model
C. Regression model
D. Computer vision model


Correct Answer

A. NLP model


Explanation

Natural Language Processing (NLP) models are designed to process and understand human language, including translation tasks.


Why the Other Answers Are Incorrect

B. Clustering model

Clustering groups similar items.

C. Regression model

Regression predicts numeric outputs.

D. Computer vision model

Computer vision analyzes images and video.


Question 10

Which statement BEST describes the difference between classification and regression models?

A. Classification predicts categories, while regression predicts numeric values
B. Classification uses images, while regression uses text only
C. Regression groups data, while classification predicts prices
D. Regression and classification are identical


Correct Answer

A. Classification predicts categories, while regression predicts numeric values


Explanation

Classification models predict labels or categories, while regression models predict continuous numeric values.


Why the Other Answers Are Incorrect

B. Classification uses images, while regression uses text only

Both models can work with many data types.

C. Regression groups data, while classification predicts prices

Grouping data is clustering, not regression.

D. Regression and classification are identical

They solve different types of problems.


Final Thoughts

Understanding AI model capabilities is a critical foundational skill for the AI-901 certification exam. Microsoft expects candidates to recognize which AI model types are appropriate for different business scenarios and understand the strengths of common AI approaches.

Knowing how to match business problems to the correct AI capabilities is essential for designing effective AI solutions on Azure and beyond.


Go to the AI-901 Exam Prep Hub main page

Identify appropriate model deployment options and configuration parameters (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Identify AI concepts and capabilities (40–45%)
--> Identify AI model components and configurations
--> Identify appropriate model deployment options and configuration parameters


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Deploying AI models effectively is an important part of building real-world AI solutions and a key topic for the AI-901 certification exam. Microsoft expects candidates to understand common deployment options, model hosting approaches, and basic configuration parameters used in AI systems.

This topic falls under the “Identify AI model components and configurations” section of the exam objectives.


What Is AI Model Deployment?

Model deployment is the process of making a trained AI model available for real-world use.

After a model is trained and tested, it must be deployed so applications and users can interact with it.

Examples

  • A chatbot answering customer questions
  • A fraud detection model analyzing transactions
  • An image recognition system processing uploaded photos
  • A recommendation engine suggesting products

Deployment connects the AI model to users and applications.


Common AI Model Deployment Options

AI models can be deployed in different environments depending on business needs.

Common deployment options include:

  • Cloud deployment
  • Edge deployment
  • On-premises deployment
  • Containerized deployment
  • Real-time inference
  • Batch inference

Cloud Deployment

Cloud deployment hosts AI models in cloud platforms such as Microsoft Azure.

Benefits

  • Scalability
  • High availability
  • Managed infrastructure
  • Easier updates
  • Flexible resource allocation

Common Use Cases

  • Web applications
  • Chatbots
  • APIs
  • Enterprise AI services

Example

A customer support chatbot hosted in Azure and accessed through a website.


Edge Deployment

Edge deployment runs AI models on local devices near the data source.

Examples of Edge Devices

  • Smartphones
  • IoT devices
  • Cameras
  • Manufacturing equipment
  • Vehicles

Benefits

  • Reduced latency
  • Offline operation
  • Faster response times
  • Reduced bandwidth usage

Example

A factory camera performing real-time defect detection directly on the device.


On-Premises Deployment

On-premises deployment hosts AI models within an organization’s own data center.

Benefits

  • Greater control over data
  • Compliance support
  • Internal network security
  • Reduced external data sharing

Common Use Cases

  • Highly regulated industries
  • Sensitive data environments

Example

A hospital deploying AI systems within its internal infrastructure for patient privacy reasons.


Containerized Deployment

Containers package AI models and their dependencies into portable units.

Common container technologies include:

  • Docker
  • Kubernetes

Benefits

  • Portability
  • Consistent environments
  • Easier scaling
  • Simplified deployment

Example

Deploying an AI API inside a Docker container across multiple servers.


Real-Time Inference

Real-time inference provides immediate AI predictions or responses.

Characteristics

  • Low latency
  • Fast responses
  • Interactive applications

Example Use Cases

  • Chatbots
  • Fraud detection during transactions
  • Live recommendation systems
  • Voice assistants

Example

A chatbot generating responses instantly during a conversation.


Batch Inference

Batch inference processes large amounts of data at scheduled intervals.

Characteristics

  • High-volume processing
  • Non-interactive
  • Scheduled operations

Example Use Cases

  • Overnight report generation
  • Bulk image processing
  • Customer segmentation updates

Example

A retailer analyzing all sales data nightly to update recommendations.


APIs and Endpoints

Deployed AI models are often accessed through APIs (Application Programming Interfaces).

An endpoint is a network location where applications send requests to the AI model.

Example

A mobile app sends an image to an AI vision API endpoint for analysis.


Scalability

Scalability refers to the ability of a deployment to handle increasing workloads.

Cloud deployments often scale automatically based on:

  • Number of requests
  • CPU usage
  • Memory usage

Example

An AI chatbot automatically adds more computing resources during peak business hours.


Latency

Latency refers to response time.

Some applications require very low latency.

Low-Latency Examples

  • Autonomous vehicles
  • Fraud detection
  • Real-time translation
  • Voice assistants

Edge deployment is often used to reduce latency.


Availability and Reliability

AI systems should remain available and reliable.

High availability helps ensure systems continue functioning even during failures.

Common techniques include:

  • Redundant servers
  • Load balancing
  • Failover systems
  • Monitoring

Model Monitoring

After deployment, AI systems should be monitored continuously.

Monitoring helps identify:

  • Performance degradation
  • Bias
  • Security issues
  • Reliability problems
  • Model drift

Example

A fraud detection model becomes less accurate as customer behavior changes over time.


Model Drift

Model drift occurs when real-world data changes over time, causing reduced model accuracy.

Example

A recommendation system trained on older shopping trends may become less effective as customer preferences change.

Monitoring helps detect model drift.


AI Model Configuration Parameters

AI systems often include configurable settings that affect behavior and performance.

For AI-901, important parameters include:

  • Temperature
  • Max tokens
  • Top-p
  • Frequency penalty
  • Presence penalty

These are especially important for generative AI systems.


Temperature

Temperature controls randomness and creativity in generated responses.

TemperatureBehavior
LowMore predictable and focused
HighMore creative and varied

Example

A customer support chatbot may use a lower temperature for consistent answers.


Max Tokens

Max tokens controls the maximum length of generated output.

Example

A summarization system may limit responses to 200 tokens.


Top-p (Nucleus Sampling)

Top-p controls how many likely next-token choices the model considers.

Lower values create more focused responses.

Higher values allow greater variety.


Frequency Penalty

Frequency penalty reduces repeated words or phrases in generated text.

Example

Helps prevent repetitive chatbot responses.


Presence Penalty

Presence penalty encourages the model to introduce new topics or ideas.

This can increase response diversity.


Choosing Deployment Options

Selecting the correct deployment approach depends on:

RequirementPossible Deployment Choice
Low latencyEdge deployment
Large scalabilityCloud deployment
Sensitive dataOn-premises deployment
PortabilityContainers
Instant responsesReal-time inference
Large scheduled jobsBatch inference

Real-World Examples


Scenario 1: AI Chatbot

Requirements

  • Instant responses
  • Large user base
  • Internet access

Best Deployment

Cloud-based real-time deployment

Useful Parameters

  • Low temperature
  • Moderate max tokens

Scenario 2: Factory Defect Detection

Requirements

  • Very low latency
  • Works without internet

Best Deployment

Edge deployment


Scenario 3: Monthly Sales Forecasting

Requirements

  • Analyze large historical datasets
  • No immediate response needed

Best Deployment

Batch inference


Scenario 4: Healthcare AI System

Requirements

  • Strict privacy controls
  • Sensitive patient data

Best Deployment

On-premises deployment


Azure AI Deployment Options

Microsoft Azure AI Services provide multiple deployment approaches for AI solutions, including:

  • Cloud-hosted AI APIs
  • Container support
  • Edge deployment support
  • Managed AI services
  • Scalable inference endpoints

Azure simplifies deployment, scaling, and management of AI systems.


Responsible AI Considerations

When deploying AI models, organizations should also consider:

  • Security
  • Privacy
  • Reliability
  • Monitoring
  • Transparency
  • Accountability

Poor deployment practices can create operational or ethical risks.


Important AI-901 Exam Tips

For the exam, remember these key points:

  • Deployment makes AI models available for use.
  • Cloud deployment offers scalability and flexibility.
  • Edge deployment reduces latency and supports offline operation.
  • On-premises deployment provides greater internal control.
  • Real-time inference supports immediate responses.
  • Batch inference processes large datasets on schedules.
  • APIs and endpoints connect applications to AI models.
  • Model drift occurs when real-world data changes over time.
  • Temperature controls creativity in generative AI responses.
  • Max tokens controls output length.

Quick Knowledge Check

Question 1

What deployment option is best for very low-latency AI processing on local devices?

Answer

Edge deployment.


Question 2

What does temperature control in generative AI?

Answer

The randomness and creativity of generated responses.


Question 3

What is batch inference?

Answer

Processing large amounts of data at scheduled intervals rather than in real time.


Question 4

What is model drift?

Answer

Reduced model performance caused by changes in real-world data over time.


Practice Exam Questions

Question 1

A company needs an AI-powered chatbot that can instantly respond to customer questions on its website.

Which deployment type is MOST appropriate?

A. Batch inference
B. Real-time inference
C. Offline archival storage
D. Manual processing


Correct Answer

B. Real-time inference


Explanation

Real-time inference provides immediate responses and is commonly used for interactive applications such as chatbots.


Why the Other Answers Are Incorrect

A. Batch inference

Batch inference processes data on schedules rather than instantly.

C. Offline archival storage

Archival storage does not provide live AI responses.

D. Manual processing

Manual processing is not an AI deployment method.


Question 2

What is the PRIMARY benefit of edge deployment for AI models?

A. Unlimited cloud scalability
B. Reduced latency and local processing
C. Increased internet bandwidth usage
D. Automatic model retraining


Correct Answer

B. Reduced latency and local processing


Explanation

Edge deployment places AI models close to the data source, reducing response time and allowing operation even with limited internet connectivity.


Why the Other Answers Are Incorrect

A. Unlimited cloud scalability

This is more associated with cloud deployment.

C. Increased internet bandwidth usage

Edge deployment often reduces bandwidth usage.

D. Automatic model retraining

Edge deployment does not automatically retrain models.


Question 3

Which deployment option provides the MOST control over sensitive organizational data?

A. Public social media deployment
B. On-premises deployment
C. Edge gaming deployment
D. Anonymous deployment


Correct Answer

B. On-premises deployment


Explanation

On-premises deployment keeps systems and data within an organization’s internal infrastructure, supporting security and compliance needs.


Why the Other Answers Are Incorrect

A. Public social media deployment

This is not a standard deployment option.

C. Edge gaming deployment

This is not a recognized AI deployment category.

D. Anonymous deployment

This is not a deployment model.


Question 4

What does the temperature parameter control in many generative AI models?

A. The physical temperature of the servers
B. The creativity and randomness of generated responses
C. The storage capacity of the model
D. The speed of internet connections


Correct Answer

B. The creativity and randomness of generated responses


Explanation

Temperature controls how predictable or creative AI-generated outputs are.

Lower values create more focused responses, while higher values create more varied responses.


Why the Other Answers Are Incorrect

A. The physical temperature of the servers

Temperature is a model setting, not a hardware measurement.

C. The storage capacity of the model

Temperature does not affect storage.

D. The speed of internet connections

Temperature is unrelated to networking.


Question 5

A company processes millions of sales records every night to generate forecasts for the next day.

Which inference type is MOST appropriate?

A. Real-time inference
B. Batch inference
C. Edge inference
D. Interactive inference only


Correct Answer

B. Batch inference


Explanation

Batch inference is designed for large-scale scheduled processing rather than immediate responses.


Why the Other Answers Are Incorrect

A. Real-time inference

Real-time inference is intended for immediate responses.

C. Edge inference

Edge inference focuses on local device processing.

D. Interactive inference only

This is not a standard inference category.


Question 6

What is model drift?

A. A networking issue in cloud deployments
B. Reduced model performance caused by changes in real-world data over time
C. A method for encrypting AI outputs
D. A hardware failure in GPU systems


Correct Answer

B. Reduced model performance caused by changes in real-world data over time


Explanation

Model drift occurs when data patterns change after deployment, causing model accuracy to decline.


Why the Other Answers Are Incorrect

A. A networking issue in cloud deployments

Drift relates to data and performance, not networking.

C. A method for encrypting AI outputs

Drift is unrelated to encryption.

D. A hardware failure in GPU systems

Hardware failures are separate operational issues.


Question 7

Which deployment approach is MOST suitable for AI systems that must continue operating without internet access?

A. Cloud-only deployment
B. Edge deployment
C. Browser caching
D. Remote archival deployment


Correct Answer

B. Edge deployment


Explanation

Edge deployment allows AI models to run locally on devices, enabling offline functionality.


Why the Other Answers Are Incorrect

A. Cloud-only deployment

Cloud-only systems usually require internet connectivity.

C. Browser caching

Caching is not an AI deployment strategy.

D. Remote archival deployment

This is not a standard deployment model.


Question 8

What is the purpose of the max tokens parameter in generative AI?

A. To control the maximum response length
B. To encrypt generated text
C. To increase hardware memory
D. To reduce internet latency


Correct Answer

A. To control the maximum response length


Explanation

Max tokens limits how much text the model can generate in a response.


Why the Other Answers Are Incorrect

B. To encrypt generated text

Max tokens does not affect encryption.

C. To increase hardware memory

It does not change hardware capacity.

D. To reduce internet latency

It is unrelated to network speed.


Question 9

What is an AI endpoint?

A. A backup storage device
B. A network location where applications send requests to an AI model
C. A hardware cooling system
D. A type of training dataset


Correct Answer

B. A network location where applications send requests to an AI model


Explanation

Endpoints allow applications and users to interact with deployed AI models through APIs.


Why the Other Answers Are Incorrect

A. A backup storage device

Endpoints are not storage systems.

C. A hardware cooling system

Cooling systems are unrelated.

D. A type of training dataset

Endpoints are deployment interfaces.


Question 10

Which deployment option is MOST associated with automatic scalability and managed infrastructure?

A. Cloud deployment
B. Manual deployment
C. Printed deployment
D. Standalone spreadsheet deployment


Correct Answer

A. Cloud deployment


Explanation

Cloud deployment platforms such as Microsoft Azure provide scalable infrastructure and managed services for AI workloads.


Why the Other Answers Are Incorrect

B. Manual deployment

Manual deployment does not provide automatic scalability.

C. Printed deployment

This is not a valid deployment option.

D. Standalone spreadsheet deployment

Spreadsheets are not scalable AI deployment platforms.


Final Thoughts

Understanding AI deployment options and configuration parameters is an important foundational skill for the AI-901 certification exam. Microsoft expects candidates to recognize when different deployment strategies and model settings are appropriate for business and technical requirements.

These concepts help organizations deploy scalable, reliable, and effective AI solutions using Azure AI technologies.


Go to the AI-901 Exam Prep Hub main page