Create new visual outputs by using generative models (AI-901 Exam Prep)

This post is a part of the AI-901: Microsoft Azure AI Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Implement AI solutions by using Microsoft Foundry (55–60%)
--> Implement AI solutions with computer vision and image-generation capabilities by using Foundry
--> Create new visual outputs by using generative models


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Generative AI models are capable of creating entirely new content based on patterns learned during training. One important category of generative AI focuses on producing visual outputs such as images, artwork, diagrams, and design concepts.

For the AI-901 certification exam, candidates should understand the foundational concepts behind creating new visual outputs by using generative AI models through Microsoft Azure AI Foundry and related Azure AI services.

This topic falls under the “Implement AI solutions with computer vision and image-generation capabilities by using Foundry” section of the AI-901 exam objectives.


What Is Generative AI?

Generative AI refers to AI systems capable of creating new content rather than simply analyzing existing data.

Generative AI can produce:

  • Text
  • Images
  • Audio
  • Video
  • Code

What Are Generative Image Models?

Generative image models create new visual content from prompts or instructions.

These models can generate:

  • Artwork
  • Illustrations
  • Photorealistic images
  • Concept designs
  • Marketing graphics

Example Prompt

“Create an image of a futuristic city at sunset.”

The model generates a new image based on the description.


Azure AI Foundry

Azure AI Foundry provides tools for building and deploying AI-powered applications, including generative AI solutions.

Developers can:

  • Access generative models
  • Test prompts
  • Deploy models
  • Build AI applications

Image Generation Workflow

A common image-generation workflow includes:

  1. User enters prompt
  2. Application sends prompt to model
  3. Generative model creates image
  4. Application displays generated output

Text-to-Image Generation

Text-to-image models generate images from natural-language prompts.


Example

Prompt

“A golden retriever wearing sunglasses on a beach.”

Result

A newly generated image matching the description.


Image Editing

Some generative models can modify existing images.

Capabilities may include:

  • Removing objects
  • Replacing backgrounds
  • Extending images
  • Applying artistic styles

Example

Original Image

Photo of a park

Prompt

“Add snow to the scene.”

The model generates an updated version of the image.


Style Transfer

Style transfer applies artistic styles to images.


Example

Prompt

“Make this image look like a watercolor painting.”

The AI transforms the image style.


Inpainting

Inpainting fills missing or selected portions of images.


Example

A damaged image has missing areas that the AI reconstructs.


Outpainting

Outpainting expands images beyond their original boundaries.


Example

A cropped landscape image is extended to show more scenery.


Prompt Engineering

Prompt engineering involves crafting prompts that improve AI-generated results.

Good prompts are:

  • Clear
  • Detailed
  • Specific

Weak Prompt Example

“Create a dog.”


Better Prompt Example

“Create a realistic golden retriever sitting beside a lake during sunset.”


System Prompts

System prompts guide the overall behavior of the AI model.

They may define:

  • Safety rules
  • Content restrictions
  • Tone
  • Style preferences

Model Parameters

Generative AI models may use parameters that influence output behavior.

Common concepts include:

  • Creativity/randomness
  • Response length
  • Style guidance

For AI-901, conceptual understanding is more important than memorizing exact parameter names.


APIs and Endpoints

Applications communicate with deployed generative models using:

  • APIs
  • Endpoints

These allow prompts and images to be processed programmatically.


Authentication

Applications must securely authenticate before using Azure AI services.

Common authentication methods include:

  • API keys
  • Azure credentials
  • Managed identities

User Interface Components

A lightweight image-generation application may include:

  • Prompt text box
  • Image upload option
  • Generate button
  • Image display area

Real-Time Generation

Some applications generate images interactively in near real time.

This improves user experience and experimentation.


Common Real-World Scenarios


Scenario 1: Marketing Content Creation

Goal

Generate promotional graphics.

Features

  • Text-to-image generation
  • Brand-aligned designs
  • Rapid content creation

Scenario 2: Product Concept Design

Goal

Visualize product ideas.

Features

  • Prototype generation
  • Style experimentation
  • Rapid iteration

Scenario 3: Educational Content

Goal

Generate learning visuals and illustrations.

Features

  • Diagram generation
  • Visual storytelling
  • Accessibility support

Scenario 4: Entertainment and Gaming

Goal

Create concept art and environments.

Features

  • Character design
  • Landscape generation
  • Artistic experimentation

Responsible AI Considerations

Generative image applications should follow Responsible AI principles.

Key considerations include:

  • Fairness
  • Privacy
  • Transparency
  • Inclusiveness
  • Accountability
  • Security

Copyright and Intellectual Property

Organizations should consider:

  • Ownership rights
  • Licensing concerns
  • Use of copyrighted material

Generated content may still raise legal and ethical questions.


Harmful Content Risks

Generative AI systems may create:

  • Offensive content
  • Misleading images
  • Unsafe material

Content filtering and moderation are important safeguards.


Deepfakes

AI-generated images or videos designed to imitate real people are called deepfakes.

Deepfakes can create ethical and security concerns.


Hallucinations

Generative models may produce inaccurate or unrealistic outputs.

These incorrect outputs are called hallucinations.


Bias and Fairness

Generated images may unintentionally reflect societal biases.

Examples include:

  • Stereotypical portrayals
  • Uneven representation
  • Cultural bias

Transparency

Users should understand:

  • AI generated the image
  • Outputs may contain inaccuracies
  • Images may be synthetic rather than real

Error Handling

Applications should handle:

  • Invalid prompts
  • Unsupported file types
  • Network interruptions
  • Authentication failures
  • Rate limits

Advantages of Generative Image Models

Benefits include:

  • Faster content creation
  • Creative assistance
  • Rapid prototyping
  • Automation
  • Enhanced user engagement

Limitations of Generative Models

Challenges include:

  • Hallucinations
  • Bias
  • Ethical concerns
  • Copyright uncertainty
  • Variable output quality

High-Level Workflow

A simplified workflow includes:

  1. User enters prompt
  2. Application sends request
  3. Model generates image
  4. Application displays output

Example High-Level Pseudocode

prompt = get_prompt()
image = generate_image(prompt)
display_image(image)

For AI-901, understanding the workflow is more important than memorizing exact syntax.


Important AI-901 Exam Tips

For the exam, remember these key points:

  • Generative AI creates new content.
  • Text-to-image models generate images from prompts.
  • Azure AI Foundry supports generative AI development.
  • Prompt engineering improves output quality.
  • APIs and endpoints connect applications to AI services.
  • Authentication secures access to Azure AI resources.
  • Deepfakes are synthetic media designed to imitate real people.
  • Hallucinations are inaccurate AI-generated outputs.
  • Responsible AI principles apply to generative image systems.
  • Transparency is important when presenting AI-generated content.

Quick Knowledge Check

Question 1

What does a text-to-image model do?

Answer

Generates images from natural-language prompts.


Question 2

What is prompt engineering?

Answer

Designing prompts to improve AI-generated results.


Question 3

What are deepfakes?

Answer

AI-generated media designed to imitate real people.


Question 4

Why is transparency important in generative AI?

Answer

Users should understand that AI generated the content and that inaccuracies may exist.


Practice Exam Questions

Question 1

What is the PRIMARY purpose of a generative AI model?

A. To create new content based on learned patterns
B. To replace computer hardware
C. To increase internet bandwidth
D. To manage operating systems


Correct Answer

A. To create new content based on learned patterns


Explanation

Generative AI models create new outputs such as images, text, audio, or video using patterns learned during training.


Why the Other Answers Are Incorrect

B. To replace computer hardware

Generative AI is software-based and does not replace hardware.

C. To increase internet bandwidth

AI models do not improve network speeds.

D. To manage operating systems

Operating system management is unrelated to generative AI.


Question 2

What does a text-to-image model do?

A. Generates images from text prompts
B. Converts images into spreadsheets
C. Detects malware in files
D. Compresses image files automatically


Correct Answer

A. Generates images from text prompts


Explanation

Text-to-image models create images based on natural-language descriptions provided by users.


Why the Other Answers Are Incorrect

B. Converts images into spreadsheets

This is unrelated to generative AI.

C. Detects malware in files

This is a cybersecurity task.

D. Compresses image files automatically

Compression is unrelated to image generation.


Question 3

Which Microsoft platform provides tools for building and deploying generative AI applications?

A. Azure AI Foundry
B. Microsoft Paint
C. Windows File Explorer
D. Microsoft Notepad


Correct Answer

A. Azure AI Foundry


Explanation

Azure AI Foundry provides tools for deploying, testing, and managing AI-powered applications.


Why the Other Answers Are Incorrect

B. Microsoft Paint

Paint is a graphics editor, not an AI platform.

C. Windows File Explorer

This is a file management tool.

D. Microsoft Notepad

Notepad is a text editor.


Question 4

What is prompt engineering?

A. Designing prompts to improve AI-generated results
B. Repairing damaged computer hardware
C. Compressing images into smaller files
D. Monitoring internet traffic


Correct Answer

A. Designing prompts to improve AI-generated results


Explanation

Prompt engineering involves creating clear and specific prompts to guide AI systems toward better outputs.


Why the Other Answers Are Incorrect

B. Repairing damaged computer hardware

This is unrelated to AI prompting.

C. Compressing images into smaller files

Compression is unrelated to prompts.

D. Monitoring internet traffic

This is a networking task.


Question 5

Which prompt is MOST likely to generate a detailed image?

A. “Create a dog.”
B. “Generate.”
C. “Create a realistic golden retriever sitting beside a lake during sunset.”
D. “Image.”


Correct Answer

C. “Create a realistic golden retriever sitting beside a lake during sunset.”


Explanation

Detailed prompts generally produce more accurate and useful AI-generated images.


Why the Other Answers Are Incorrect

A. “Create a dog.”

This prompt is too vague.

B. “Generate.”

This provides almost no guidance.

D. “Image.”

This prompt is incomplete and unclear.


Question 6

What is inpainting?

A. Filling or reconstructing parts of an image
B. Converting speech into text
C. Detecting objects in video streams
D. Encrypting image files


Correct Answer

A. Filling or reconstructing parts of an image


Explanation

Inpainting allows AI to fill in missing or selected regions within an image.


Why the Other Answers Are Incorrect

B. Converting speech into text

This is speech recognition.

C. Detecting objects in video streams

This is a computer vision task.

D. Encrypting image files

Encryption is unrelated to inpainting.


Question 7

What are deepfakes?

A. AI-generated media designed to imitate real people
B. Hardware failures in AI systems
C. Encrypted image storage systems
D. High-speed networking protocols


Correct Answer

A. AI-generated media designed to imitate real people


Explanation

Deepfakes use generative AI to create realistic but synthetic media that imitates real individuals.


Why the Other Answers Are Incorrect

B. Hardware failures in AI systems

This is unrelated to generated media.

C. Encrypted image storage systems

This is unrelated to deepfakes.

D. High-speed networking protocols

Networking is unrelated to deepfake technology.


Question 8

How do applications typically communicate with deployed generative AI models?

A. Through APIs and endpoints
B. Through printer drivers
C. Through monitor calibration settings
D. Through USB-only connections


Correct Answer

A. Through APIs and endpoints


Explanation

Applications use APIs and endpoints to send prompts and receive generated outputs from AI services.


Why the Other Answers Are Incorrect

B. Through printer drivers

Printers are unrelated to AI communication.

C. Through monitor calibration settings

This is unrelated to cloud AI services.

D. Through USB-only connections

Cloud AI services use network communication.


Question 9

Which Responsible AI concern is especially important for generative image models?

A. Preventing harmful or misleading content generation
B. Increasing keyboard typing speed
C. Improving spreadsheet formulas
D. Reducing monitor power consumption


Correct Answer

A. Preventing harmful or misleading content generation


Explanation

Generative AI systems can potentially create unsafe, offensive, or misleading content, making moderation and safeguards important.


Why the Other Answers Are Incorrect

B. Increasing keyboard typing speed

This is unrelated to Responsible AI.

C. Improving spreadsheet formulas

This is unrelated to image generation.

D. Reducing monitor power consumption

This is unrelated to AI ethics.


Question 10

What are hallucinations in generative AI systems?

A. Inaccurate or fabricated AI-generated outputs
B. Hardware installation errors
C. Network outages
D. Audio playback failures


Correct Answer

A. Inaccurate or fabricated AI-generated outputs


Explanation

Hallucinations occur when generative AI produces incorrect, unrealistic, or invented outputs.


Why the Other Answers Are Incorrect

B. Hardware installation errors

This is unrelated to AI-generated content.

C. Network outages

This is a connectivity issue.

D. Audio playback failures

This is unrelated to generative image models.


Final Thoughts

Creating new visual outputs by using generative models is an important AI-901 certification topic. Microsoft expects candidates to understand the foundational concepts behind generative image AI, including text-to-image generation, prompt engineering, APIs, deployment, Responsible AI principles, hallucinations, and ethical considerations.

Azure AI Foundry provides powerful tools for building intelligent applications capable of generating creative visual content for business, education, accessibility, and entertainment scenarios.


Go to the AI-901 Exam Prep Hub main page

One thought on “Create new visual outputs by using generative models (AI-901 Exam Prep)”

Leave a comment