Tag: Azure AI Speech

Practice Questions: Describe Capabilities of the Azure AI Speech Service (AI-900 Exam Prep)

Practice Exam Questions


Question 1

A company wants to automatically convert recorded customer support calls into written transcripts for analysis.
Which Azure service should they use?

A. Azure AI Language
B. Azure AI Vision
C. Azure AI Speech
D. Azure Translator

Correct Answer: C

Explanation:
Azure AI Speech provides Speech to Text, which converts spoken audio into written text. Azure AI Language analyzes existing text but does not process audio.


Question 2

An application needs to read written instructions aloud to users using natural-sounding voices.
Which Azure AI Speech capability is required?

A. Speech to Text
B. Text to Speech
C. Speaker Recognition
D. Speech Translation

Correct Answer: B

Explanation:
Text to Speech converts written text into spoken audio. This is commonly used for accessibility and voice assistants.


Question 3

A global company wants users to speak in Spanish and hear an English audio response in real time.
Which Azure AI Speech feature supports this scenario?

A. Text Analytics
B. Azure Translator
C. Speech Translation
D. Speaker Identification

Correct Answer: C

Explanation:
Speech Translation enables real-time translation of spoken language and can output translated speech or text.


Question 4

Which scenario is best suited for Azure AI Speech instead of Azure AI Language?

A. Extracting key phrases from emails
B. Detecting sentiment in product reviews
C. Transcribing audio from meetings
D. Identifying entities in documents

Correct Answer: C

Explanation:
Azure AI Speech handles audio-based workloads such as transcribing meetings. Azure AI Language is used for written text analysis.


Question 5

A banking app needs to verify a user’s identity based on their voice.
Which Azure AI Speech capability should be used?

A. Speech to Text
B. Speaker Recognition
C. Text to Speech
D. Language Detection

Correct Answer: B

Explanation:
Speaker Recognition is used to verify or identify individuals based on voice characteristics.


Question 6

Which Azure AI Speech capability converts spoken language into written text in real time?

A. Speech Translation
B. Text to Speech
C. Speech to Text
D. Speaker Identification

Correct Answer: C

Explanation:
Speech to Text converts audio input into text and supports real-time transcription.


Question 7

A developer wants to generate lifelike, human-sounding voices for a virtual assistant.
Which feature of Azure AI Speech makes this possible?

A. Optical character recognition
B. Neural voices
C. Language modeling
D. Sentiment analysis

Correct Answer: B

Explanation:
Azure AI Speech uses neural voices to produce natural-sounding speech output.


Question 8

Which input type is primarily required when using the Azure AI Speech service?

A. Images
B. Video streams
C. Audio data
D. Structured tables

Correct Answer: C

Explanation:
Azure AI Speech is designed to process audio input, such as spoken language or sound recordings.


Question 9

Which scenario would require combining multiple Azure AI Speech capabilities?

A. Detecting faces in images
B. Translating written documents
C. Speaking in one language and hearing a translated spoken response
D. Analyzing sentiment in customer feedback

Correct Answer: C

Explanation:
This scenario combines Speech to Text, Translation, and Text to Speech to deliver a speech-to-speech experience.


Question 10

Which statement best describes Azure AI Speech?

A. It analyzes written documents for meaning
B. It processes images and videos
C. It enables spoken language understanding and generation
D. It is used only for chatbots

Correct Answer: C

Explanation:
Azure AI Speech focuses on spoken language, including recognition, synthesis, translation, and speaker identification.


Final Exam Tips 🧠

  • If the question mentions audio, voice, or speech, think Azure AI Speech
  • Know the difference between:
    • Speech to Text
    • Text to Speech
    • Speech Translation
    • Speaker Recognition
  • AI-900 questions are conceptual and scenario-based, not technical

Go to the AI-900 Exam Prep Hub main page.

Describe Capabilities of the Azure AI Speech Service (AI-900 Exam Prep)

Where This Fits in the Exam

  • Exam: AI-900 – Microsoft Azure AI Fundamentals
  • Domain: Describe features of Natural Language Processing (NLP) workloads on Azure (15–20%)
  • Sub-area: Identify Azure tools and services for NLP workloads

For AI-900, Microsoft expects you to understand what the Azure AI Speech service does, when to use it, and how it differs from other AI services — not how to code it.


What Is the Azure AI Speech Service?

The Azure AI Speech service is a cloud-based service that enables applications to process spoken language. It allows systems to:

  • Convert speech into text
  • Convert text into natural-sounding speech
  • Translate spoken language
  • Recognize speakers and voices

It is part of Azure AI Services and focuses on audio and voice-based NLP workloads.


Core Capabilities of Azure AI Speech

1. Speech to Text

Speech to Text converts spoken audio into written text.

Key features:

  • Real-time transcription
  • Batch transcription of audio files
  • Support for multiple languages
  • Automatic punctuation and formatting

Common use cases:

  • Transcribing meetings or calls
  • Voice-controlled applications
  • Call center analytics
  • Accessibility tools (captions and subtitles)

📌 AI-900 exam tip:
If the question mentions converting spoken words into text, the answer is Azure AI Speech (Speech to Text).


2. Text to Speech

Text to Speech converts written text into natural-sounding spoken audio.

Key features:

  • Neural voices that sound human-like
  • Multiple languages and accents
  • Adjustable pitch, speed, and tone
  • Support for voice styles (e.g., cheerful, calm)

Common use cases:

  • Voice assistants
  • Read-aloud applications
  • Accessibility for visually impaired users
  • Automated announcements

📌 AI-900 exam tip:
If the scenario describes reading text out loud, think Text to Speech.


3. Speech Translation

Speech Translation converts spoken language into another language, either as text or synthesized speech.

Key features:

  • Real-time speech translation
  • Multi-language support
  • Can output translated speech or text

Common use cases:

  • Multilingual meetings
  • Travel and tourism apps
  • International customer support

📌 AI-900 exam tip:
Speech translation handles spoken language, while Azure Translator handles written text.


4. Speaker Recognition

Speaker Recognition identifies or verifies who is speaking based on their voice.

Capabilities include:

  • Speaker verification (confirming identity)
  • Speaker identification (determining who is speaking)

Common use cases:

  • Secure voice authentication
  • Call center speaker tracking
  • Personalized voice experiences

📌 AI-900 note:
You only need to understand what it does, not how voice models are trained.


5. Speech-to-Speech Scenarios

By combining Speech to Text, Translation, and Text to Speech, Azure AI Speech supports end-to-end voice experiences, such as:

  • Speaking in one language and hearing a response in another
  • Voice-based chatbots
  • Smart devices and assistants

How Azure AI Speech Differs from Other Azure AI Services

ServicePrimary Purpose
Azure AI SpeechSpoken language (audio)
Azure AI LanguageWritten text analysis
Azure TranslatorText translation
Azure AI VisionImages and video

📌 Exam pattern to watch for:
Microsoft often tests whether you can choose the right service based on the input type (audio vs text vs image).


Typical AI-900 Scenarios Involving Azure AI Speech

You should choose Azure AI Speech when a scenario involves:

  • Audio recordings
  • Live speech
  • Voice input or output
  • Real-time transcription
  • Spoken translation

Key Takeaways for the AI-900 Exam

  • Azure AI Speech focuses on spoken language, not written text
  • Core capabilities:
    • Speech to Text
    • Text to Speech
    • Speech Translation
    • Speaker Recognition
  • Exam questions are scenario-based, not technical
  • If the question mentions audio, voice, or speech, Azure AI Speech is usually the answer

Go to the Practice Exam Questions for this topic.

Go to the AI-900 Exam Prep Hub main page.