This post is a part of the AI-103: Develop AI Apps and Agents on Azure Exam Prep Hub. 
This topic falls under these sections:
Implement computer vision solutions (10–15%)
   --> Design and implement image- and video-generation solutions
      --> Configure image-editing workflows, including inpainting, mask-based edits, and prompt-driven modifications

Note that there are 10 practice questions (with answers and explanations) at the end of each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available from the hub's main page below the exam topics section.

Introduction

Modern generative AI systems are capable of much more than simply generating images from scratch. Organizations increasingly use AI-powered image editing workflows to:

Modify existing images
Replace objects
Edit backgrounds
Improve image quality
Apply artistic styles
Perform targeted visual changes

For the AI-103 certification exam, you should understand how to configure and implement image-editing workflows using:

Inpainting
Mask-based editing
Prompt-driven modifications
Reference images
Multi-modal editing pipelines

You should also understand:

Workflow orchestration
Prompt engineering
Responsible AI considerations
Content safety
Storage and processing workflows
Azure services commonly used in image editing systems

This topic falls under:

“Design and implement image- and video-generation solutions”

What Is AI Image Editing?

AI image editing uses generative AI models to modify existing images based on:

Text prompts
Masks
Reference media
Style instructions

Unlike text-to-image generation, image editing starts with an existing image and selectively changes portions of it.

Common Image Editing Use Cases

Marketing and Advertising

Modify:

Product backgrounds
Seasonal themes
Promotional imagery

E-Commerce

Generate:

Product variations
Lifestyle scenes
Background replacements

Photography

Enhance:

Lighting
Resolution
Object cleanup
Scene composition

Entertainment and Media

Create:

Visual effects
Character edits
Stylized artwork

Enterprise Applications

Support:

Brand-compliant imagery
AI-assisted design workflows
Automated content generation

Core Components of AI Image Editing

AI image-editing workflows commonly include:

Source image
Editing instructions
Masks
Generative model
Safety validation
Output rendering

What Is Inpainting?

Definition

Inpainting is an AI editing technique that modifies selected portions of an image while preserving the rest of the image.

The system uses:

An original image
A mask identifying editable regions
A text prompt describing desired changes

How Inpainting Works

The workflow typically includes:

Upload original image
Define editable region using a mask
Provide prompt instructions
AI model generates replacement content
Blend generated content into original image

Example Inpainting Scenario

Original image:

Person standing in a park

Mask:

Covers the person’s jacket

Prompt:

Replace the jacket with a red leather jacket

Result:

Only the jacket changes
Background and other elements remain intact

Common Inpainting Use Cases

Object Removal

Remove:

Watermarks
Background clutter
Unwanted objects

Object Replacement

Replace:

Clothing
Furniture
Products
Signs

Background Editing

Modify scenery while preserving foreground subjects.

Image Restoration

Repair:

Damaged photographs
Missing sections
Visual defects

What Is a Mask?

A mask defines which parts of an image may be modified.

Mask-Based Editing

Purpose of Masks

Masks allow precise control over edits.

White or highlighted regions typically indicate:

Editable areas

Unmasked regions remain unchanged.

Types of Masks

Binary Masks

Simple editable/non-editable regions.

Soft Masks

Allow gradual blending between edited and preserved areas.

Semantic Masks

Generated automatically using object detection or segmentation.

Examples:

Person segmentation
Background segmentation
Sky detection

Manual vs Automated Mask Creation

Manual Masks

Users draw editable areas manually.

Advantages:

Precise control
Flexible editing

Automated Masks

AI identifies objects automatically.

Advantages:

Faster workflows
Reduced manual effort

Prompt-Driven Modifications

What Are Prompt-Driven Modifications?

Prompt-driven editing uses natural language instructions to guide image modifications.

The prompt describes:

Desired changes
Style
Color
Objects
Mood
Lighting

Example Prompt-Driven Edits

Style Modification

Transform this image into a watercolor painting

Background Replacement

Replace the background with a snowy mountain landscape

Object Addition

Add a golden retriever sitting beside the person

Lighting Adjustments

Convert the scene to nighttime with neon lighting

Prompt Engineering for Image Editing

Why Prompt Engineering Matters

Clear prompts improve:

Editing accuracy
Consistency
Style control
Realism

Effective Prompt Components

Component	Example
Object	“A wooden table”
Style	“minimalist design”
Environment	“modern office”
Lighting	“soft warm lighting”
Quality	“highly detailed”

Negative Prompts

Negative prompts specify unwanted characteristics.

Example:

blurry, distorted, extra limbs, low quality

These help improve output quality.

Multi-Step Editing Workflows

Enterprise systems often use multiple editing stages.

Example Workflow

Upload image
Detect editable objects
Generate masks
Apply prompt-driven edits
Run safety validation
Generate variations
Store approved outputs

Image Segmentation in Editing Workflows

What Is Image Segmentation?

Segmentation identifies objects or regions within images.

Segmentation helps:

Create masks automatically
Improve editing precision
Enable object-aware workflows

Types of Segmentation

Semantic Segmentation

Groups pixels by category.

Example:

Sky
Road
Person

Instance Segmentation

Separates individual objects.

Example:

Person 1
Person 2
Car 1

Style Transfer

What Is Style Transfer?

Style transfer applies the artistic style of one image to another.

Examples:

Oil painting style
Anime style
Sketch style
Watercolor style

Image Variations

Generative editing systems can produce:

Multiple alternate edits
Different styles
Different lighting conditions
Multiple compositions

This helps users compare outputs.

Outpainting

What Is Outpainting?

Outpainting extends an image beyond its original boundaries.

Use cases:

Expanding landscapes
Creating panoramic scenes
Extending backgrounds

Workflow Automation

Image-editing pipelines are commonly automated using:

APIs
Serverless workflows
Event-driven orchestration

Example Automated Workflow

User uploads product image
Azure Function triggers workflow
AI model removes background
New background generated
Safety checks run
Final image stored

Responsible AI Considerations

Image editing introduces several Responsible AI concerns.

Deepfake Risks

Image editing can alter:

Faces
Identities
Appearances

Improper use may create misleading content.

Harmful Content Generation

Edits may unintentionally create:

Violent imagery
Hate content
Explicit material

Copyright Concerns

Generated edits may resemble copyrighted works.

Organizations should ensure proper licensing.

Bias and Fairness

Editing systems may unintentionally reinforce:

Stereotypes
Representation imbalance
Cultural bias

Azure AI Content Safety

Microsoft provides:
Azure AI Content Safety

to help detect:

Harmful prompts
Unsafe outputs
Policy violations

Moderation Workflows

Enterprise systems may:

Block unsafe edits
Flag outputs for review
Require human approval

Human-in-the-Loop Validation

Organizations often require manual review for:

Brand-sensitive content
Regulated industries
Public-facing media

Performance Considerations

Image editing can require substantial compute resources.

Factors affecting performance include:

Image resolution
Mask complexity
Model size
Number of variations
GPU availability

GPU Acceleration

Generative image editing heavily relies on GPUs because of:

Parallel computation
Matrix operations
Rendering efficiency

Optimization Techniques

Lower Resolution Drafts

Preview edits before full rendering.

Progressive Upscaling

Generate smaller images first, then upscale.

Cached Assets

Reuse commonly edited assets.

Parallel Variation Generation

Create multiple outputs simultaneously.

Azure Services for Image Editing Workflows

Azure OpenAI Service

Supports:

Multi-modal AI workflows
Prompt-driven editing
Image generation pipelines

Azure AI Foundry

Used for:

Prompt orchestration
Workflow development
Model evaluation
AI pipeline management

Azure AI Vision

Can support:

Segmentation
Object detection
Image analysis
Automated mask generation

Azure Blob Storage

Frequently used for:

Storing source images
Managing edited outputs
Workflow integration

Azure Functions

Often used for:

Workflow orchestration
Trigger-based processing
Automation pipelines

Observability for Image Editing Systems

Production systems should monitor:

Editing latency
Failed requests
GPU utilization
Safety violations
Prompt trends
Storage usage
Operational costs

Best Practices for Image Editing Solutions

Use Precise Masks

Improves editing accuracy.

Write Detailed Prompts

Clear prompts produce better results.

Validate Inputs and Outputs

Apply safety filtering consistently.

Maintain Audit Logs

Track prompts, edits, and approvals.

Use Human Review for Sensitive Content

Especially important for regulated industries.

Optimize for Cost and Latency

Balance rendering quality with operational efficiency.

Protect User Privacy

Secure uploaded images appropriately.

Real-World Example

An e-commerce retailer may implement an image-editing workflow that:

Accepts a clothing product image
Automatically segments the background
Uses prompt:

Replace the background with a luxury fashion studio setting

Generates multiple styled variations
Runs safety validation
Stores approved outputs in Blob Storage

This demonstrates:

Mask-based editing
Prompt-driven modification
Automated workflows
Safety enforcement

Exam Tips for AI-103

For the AI-103 exam, remember these important concepts:

Inpainting edits selected portions of an image.
Masks define editable regions.
Prompt-driven editing uses natural language instructions.
Segmentation can automate mask generation.
Negative prompts help avoid undesirable outputs.
Outpainting expands image boundaries.
Style transfer changes artistic appearance.
Azure AI Content Safety helps moderate unsafe content.
Azure Blob Storage commonly stores source and edited images.
GPU acceleration is important for performance.
Human review may be required for sensitive content.

AI image editing can potentially be used to create misleading or impersonated content.

Go to the AI-103 Exam Prep Hub main page