Tag: Speech Recognition

Practice Questions: Identify Features and Uses for Speech Recognition and Synthesis (AI-900 Exam Prep)

Practice Questions


Question 1

A company wants to convert recorded customer support calls into written transcripts for analysis.
Which NLP workload is required?

A. Speech synthesis
B. Language modeling
C. Speech recognition
D. Text translation

Correct Answer: C

Explanation:
Speech recognition converts spoken audio into text. Transcribing recorded calls is a classic speech recognition scenario.


Question 2

An application reads incoming emails aloud to visually impaired users.
Which capability does this require?

A. Speech recognition
B. Speech synthesis
C. Key phrase extraction
D. Sentiment analysis

Correct Answer: B

Explanation:
Speech synthesis converts text into spoken audio, making it ideal for reading text aloud.


Question 3

Which Azure service provides both speech-to-text and text-to-speech capabilities?

A. Azure AI Language
B. Azure AI Vision
C. Azure AI Speech
D. Azure Machine Learning

Correct Answer: C

Explanation:
Azure AI Speech supports both speech recognition (speech-to-text) and speech synthesis (text-to-speech).


Question 4

A voice-controlled virtual assistant must understand spoken commands from users.
Which NLP workload does this scenario require?

A. Text analytics
B. Speech synthesis
C. Speech recognition
D. Language translation

Correct Answer: C

Explanation:
Understanding spoken commands requires converting speech into text, which is speech recognition.


Question 5

A chatbot responds verbally to users after processing their requests.
Which capability enables the chatbot to speak its responses?

A. Speech recognition
B. Speech synthesis
C. Entity recognition
D. Language detection

Correct Answer: B

Explanation:
Speech synthesis generates spoken audio from text, enabling verbal responses.


Question 6

Which input and output combination correctly describes speech recognition?

A. Text input → Audio output
B. Audio input → Text output
C. Text input → Text output
D. Audio input → Audio output

Correct Answer: B

Explanation:
Speech recognition takes audio input and produces text output.


Question 7

Which scenario uses both speech recognition and speech synthesis?

A. Extracting key phrases from a document
B. Translating text from English to Spanish
C. A voice assistant that listens and responds verbally
D. Analyzing customer sentiment in reviews

Correct Answer: C

Explanation:
A voice assistant listens (speech recognition) and speaks back (speech synthesis), using both capabilities together.


Question 8

A system generates natural-sounding voices with adjustable pitch and speed.
Which technology is being used?

A. Speech recognition
B. Language modeling
C. Speech synthesis
D. Optical character recognition

Correct Answer: C

Explanation:
Speech synthesis creates spoken audio and can adjust voice characteristics such as pitch and speed.


Question 9

Which phrase in a question most strongly indicates a speech recognition workload?

A. “Identify important terms in a document”
B. “Analyze the emotional tone of text”
C. “Convert spoken instructions into written commands”
D. “Generate audio from text responses”

Correct Answer: C

Explanation:
Converting spoken instructions into text is speech recognition.


Question 10

Which Azure NLP workload is most appropriate for real-time meeting transcription?

A. Speech synthesis
B. Speech recognition
C. Entity recognition
D. Language detection

Correct Answer: B

Explanation:
Real-time transcription requires converting live audio into text, which is speech recognition.


Final Exam Tips

  • Speech → Text = Speech recognition
  • Text → Speech = Speech synthesis
  • Voice assistants usually require both
  • Azure service to remember: Azure AI Speech
  • Watch for keywords like:
    • Transcribe, dictate, spoken commands → Recognition
    • Read aloud, generate voice, spoken response → Synthesis

Go to the AI-900 Exam Prep Hub main page.