Tag: DP-900 Exam Prep

Practice Questions: Describe features of semi-structured data (DP-900 Exam Prep)

Practice Questions


Question 1

Which statement best describes semi-structured data?

A. Data stored strictly in rows and columns
B. Data with no identifiable organization
C. Data that uses a flexible structure with self-describing elements
D. Data that can only be stored in relational databases

Answer: C

Explanation:
Semi-structured data does not use rigid tables but contains self-describing elements (such as key–value pairs or tags) that provide partial structure.


Question 2

Which of the following is a common format for semi-structured data?

A. CSV
B. JSON
C. JPEG
D. MP4

Answer: B

Explanation:
JSON is one of the most common semi-structured formats used in APIs, applications, and document databases.


Question 3

Semi-structured data typically uses which schema approach?

A. Schema-on-write
B. Schema-on-delete
C. Schema-on-read
D. Fixed schema

Answer: C

Explanation:
Semi-structured data uses schema-on-read, meaning structure is applied when the data is queried, not when it is stored.


Question 4

Which Azure service is commonly used to store JSON-based semi-structured data?

A. Azure SQL Database
B. Azure Blob Storage only
C. Azure Cosmos DB
D. Azure Files

Answer: C

Explanation:
Azure Cosmos DB is a globally distributed NoSQL service designed to store semi-structured JSON documents.


Question 5

Which characteristic differentiates semi-structured data from structured data?

A. It cannot be queried
B. It requires primary keys
C. It allows records with different fields
D. It must be stored in spreadsheets

Answer: C

Explanation:
In semi-structured data, individual records can have different attributes, unlike structured data which enforces uniform columns.


Question 6

What does it mean when semi-structured data is described as self-describing?

A. It automatically documents itself
B. It contains embedded field names or tags
C. It always includes metadata files
D. It uses SQL syntax

Answer: B

Explanation:
Semi-structured data includes keys or tags (like JSON property names) that describe the values they contain.


Question 7

Which scenario best represents semi-structured data?

A. A customer table with fixed columns
B. A collection of images
C. Application logs stored as JSON documents
D. Audio recordings

Answer: C

Explanation:
JSON-based application logs are classic examples of semi-structured data.


Question 8

Why is semi-structured data well suited for rapidly changing applications?

A. It enforces strict schemas
B. It supports schema-on-read and flexible structures
C. It requires fewer storage resources
D. It must be normalized

Answer: B

Explanation:
Semi-structured data allows flexible schemas, making it ideal when data models evolve frequently.


Question 9

Which feature allows nested objects in semi-structured data?

A. Tabular organization
B. Hierarchical structure
C. Index clustering
D. Column constraints

Answer: B

Explanation:
Semi-structured data supports hierarchical and nested structures, such as JSON objects inside other objects.


Question 10

Which workload most commonly produces semi-structured data?

A. Financial ledger systems
B. Payroll databases
C. Web APIs and application telemetry
D. Spreadsheet reporting

Answer: C

Explanation:
Web services, application telemetry, and IoT systems frequently generate JSON or similar semi-structured formats.


✅ Quick Exam Takeaways

For DP-900, remember:

  • Semi-structured data uses schema-on-read
  • Records can have different fields
  • Supports nested / hierarchical structures
  • Common formats: JSON, XML
  • Common Azure service: Azure Cosmos DB
  • Ideal for applications, telemetry, logs, and IoT

Go to the DP-900 Exam Prep Hub main page.

Describe Features of Structured Data (DP-900 Exam Prep)

This post is a part of the DP-900: Microsoft Azure Data Fundamentals Exam Prep Hub. 
This topic falls under these sections:
Describe core data concepts (25–30%)
--> Describe ways to represent data
--> Describe features of structured data


Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Introduction

In the DP-900 exam, you’ll need to understand what structured data is, how it’s organized, and why its characteristics matter in the context of data storage, querying, and analytics.


What Is Structured Data?

Structured data refers to information that is organized in a well-defined format and schema, typically into tables with rows and columns. Each row represents a record (an instance of an entity), and each column represents an attribute of that entity — with a specific data type (like integer, string, date). Because the structure is defined up-front, systems know exactly how to store, validate, and query the data.

In practical terms, structured data is the type of data you find in:

  • Relational databases (e.g., Azure SQL Database)
  • Spreadsheets (e.g., Excel)
  • Data warehouses

In spreadsheets for example, each row is a record (e.g., a customer, an employee, a sale), and each column is a specific attribute (e.g., customer name, hire date, transaction amount). The schema tells the system exactly how the data is arranged. This predictably organized format makes structured data highly efficient for storage, retrieval, and analysis.


Key Features of Structured Data

1. Predefined Schema

Structured data has a fixed schema — a blueprint that defines how data is organized before it is stored or queried.

  • Every table has a set number of columns.
  • Each column has a defined name and data type (such as integer, decimal, date/time, text).
  • Attempts to insert data that does not conform to the schema are typically rejected.

Example:
An “Employee” table might have columns:
| EmployeeID (int) | Name (varchar) | HireDate (date) | Email (varchar) | IsActive (boolean) |

The database enforces rules that each column only accepts (and contains) compatible values. For example, the HireDate column, which has a “date” datatype, would not allow an entry of the employee’s job title (which is of the varchar or string datatype).


2. Tabular Organization

Structured data is most often stored in tables – and a table is made up of rows and columns:

  • Rows represent individual records (e.g., a customer, a sale, an order).
  • Columns represent data attributes (e.g., customer name, transaction amount, order date).

This row/column model is familiar from relational databases and spreadsheets. This tabular layout makes the data predictable, easy to view, understand, and ingest into analytical tools.


3. Strong Data Typing

Each column has a specific datatype and validation rules, such as:

  • Integer
  • String or text
  • Date/time
  • Decimal or numeric
  • Boolean

This data typing, which prevents invalid values from being stored, helps maintain data integrity, reduce errors, and ensure consistent interpretation of values. For example:

  • A “DateOfBirth” column only accepts dates.
  • A “Price” column only accepts numeric values.

Strong typing also allows database engines to optimize storage and querying. For example:

  • Numbers can be indexed for fast lookup
  • Dates can be naturally compared and sorted

4. Easy Querying and Analysis

Because structured data adheres to a strict model, with the structure being fixed and known, it can be easily accessed and analyzed using query languages like SQL (Structured Query Language). SQL enables operations such as filtering, aggregating, joining data across tables, sorting, and more. It can be used to generate reports from the data quickly and consistently.

This is why structured data is ideal for business reporting, dashboards, and operational systems.

Database systems like Azure SQL Database use SQL to let users retrieve specific records and perform analytics efficiently.

As an example, a SQL query like the one below – which retrieves the name and hire date of employees who were hired after 01/01/2024 – is simple and efficient when run against structured data:

SELECT Name, HireDate
FROM Employees
WHERE HireDate > '2024-01-01';

5. Enforced Data Integrity and Rules

Relational databases that store structured data use rules and constraints to preserve data integrity, such as:

  • Primary keys to uniquely identify records
  • Foreign keys to express and enforce relationships between tables
  • Constraints like NOT NULL, UNIQUE, and CHECK, prevent invalid/unwanted data

These rules, along with the data typing, ensure data remains accurate, consistent, and meaningful across the entire dataset. Every row follows the same structure. This makes the data predictable, reliable and trustworthy for business reporting and analytics.

Because of this consistency:

  • Data validation is easier.
  • Automated processes function reliably.
  • Analytical and reporting tools deliver accurate results.

6. Indexing and Optimization

Although this is not a core feature, structured data systems often support indexing, which speeds up querying of data by creating optimized paths to specific values. This makes search and retrieval faster and is very important, and necessary in many cases, when working with large datasets.


Where Structured Data Is Used

Structured data is foundational in many classic applications, including:

  • Relational databases such as Azure SQL Database
    • Many business applications use SQL databases (from a variety of vendors, including Microsoft, Oracle, and others) to store data.
  • Data warehouses that aggregate business data for analytics
  • Spreadsheet systems like Microsoft Excel

All of these use fixed schemas and are typically queried with SQL or BI tools. In the Azure ecosystem, services like Azure SQL Database and Azure Synapse Analytics are designed to handle structured data workloads — enabling high-performance querying, transaction processing, and analytics.


Why Structured Data Matters for DP-900

Structured data forms the foundation of many business applications and analytical systems.

Understanding structured data is essential because:

  • It’s the foundation of relational data concepts on Azure.
  • It’s the baseline for SQL and transactional processing.
  • Many Azure services prioritize structured workloads for performance and reliability.

Understanding its features helps you:

✔ Know when to use relational databases versus non-relational stores
✔ Understand how schema affects querying and data integrity
✔ Recognize the strengths and limitations of structured formats in Azure environments

Being clear on how structured data is defined, stored, and queried will help you distinguish it from semi-structured and unstructured data — a key skill in the DP-900 exam.


Benefits of Structured Data

Because of its organization and predictability, structured data offers several advantages:

Easy querying and reporting — supported directly by SQL.
High data integrity — enforced through schemas and validation rules.
Efficient storage and processing — optimized for performance.
Readily usable by analytics tools — ideal for dashboards and BI.

These benefits make structured data ideal for many enterprise workloads where accuracy, speed, and reliability are essential.


Structured Data vs. Other Data Types

To further your understanding of structured data, it helps to contrast it with the other data types:

  • Semi-structured data has some organization but lacks a strict schema (e.g., JSON).
  • Unstructured data has no inherent structure (e.g., text documents, images).

Structured data lives on the most rigid end of this spectrum, which is why it’s easy to manage with traditional databases and analytics tools.


Summary: Exam-Relevant Takeaways

  • Structured data sits in rows and columns with a predefined schema.
    • Structured Data = predefined schema + tables + columns
  • Each column has a defined data type and validation rules.
    • Strong typing and consistency
  • Structured data can be queried efficiently with SQL.
    • Efficient querying with SQL
  • Its organization supports consistent, reliable, and fast analytics.
    • Enforced integrity via constraints and keys

Understanding these features will help you recognize when structured data is the right representation and how it compares to other data forms in Azure scenarios.


Go to the Practice Exam Questions for this topic.

Go to the DP-900 Exam Prep Hub main page.

Practice Questions: Describe features of structured data (DP-900 Exam Prep)

Practice Questions


Question 1

Which statement best describes structured data?

A. Data stored as images and videos
B. Data organized in key-value pairs without a schema
C. Data organized in rows and columns with a predefined schema
D. Data that can only be stored in files

Answer: C

Explanation:
Structured data uses a fixed schema and is typically organized into tables with rows and columns, making it easy to query and analyze.


Question 2

Which of the following is a defining characteristic of structured data?

A. Schema-on-read
B. Schema-on-write
C. No enforced data types
D. Free-form text storage

Answer: B

Explanation:
Structured data uses schema-on-write, meaning the structure (tables, columns, data types) must be defined before data is stored.


Question 3

You have a table with columns CustomerID, Name, and JoinDate. Each column has a defined data type. What feature of structured data does this demonstrate?

A. Indexing
B. Semi-structured storage
C. Strong data typing
D. Unstructured formatting

Answer: C

Explanation:
Structured data enforces strong data typing, ensuring each column only accepts valid values (e.g., dates in date columns).


Question 4

Which language is most commonly used to query structured data?

A. Python
B. JSON
C. SQL
D. XML

Answer: C

Explanation:
Structured data is designed to be queried using SQL (Structured Query Language).


Question 5

Which Azure service is primarily designed to store structured relational data?

A. Azure Blob Storage
B. Azure Data Lake Storage
C. Azure SQL Database
D. Azure File Storage

Answer: C

Explanation:
Azure SQL Database is a managed relational database service optimized for structured data.


Question 6

What does a row represent in structured data?

A. A column definition
B. A schema
C. A single record or entity instance
D. A data type

Answer: C

Explanation:
Each row represents one complete record (for example, one customer or one order).


Question 7

Which feature helps ensure that every record in a table can be uniquely identified?

A. Foreign key
B. Primary key
C. Index
D. View

Answer: B

Explanation:
A primary key uniquely identifies each row and is a core integrity feature of structured data systems.


Question 8

Why is structured data well suited for reporting and dashboards?

A. It allows free-form documents
B. It does not require validation
C. It supports predictable schemas and efficient queries
D. It stores multimedia content

Answer: C

Explanation:
Fixed schemas and SQL support make structured data ideal for analytics, reporting, and BI workloads.


Question 9

Which of the following best illustrates structured data?

A. A collection of photos
B. JSON log files
C. A spreadsheet with defined columns
D. Audio recordings

Answer: C

Explanation:
Spreadsheets with consistent columns and rows are classic examples of structured data.


Question 10

What is a major benefit of enforcing constraints such as NOT NULL and UNIQUE?

A. Faster internet connections
B. Reduced storage costs
C. Improved data integrity
D. Automatic encryption

Answer: C

Explanation:
Constraints help maintain accuracy and consistency, which is a key strength of structured data systems.


✅ Quick Exam Takeaway

For DP-900, remember:

  • Structured data uses tables (rows + columns)
  • Requires a predefined schema
  • Enforces data types and constraints
  • Is queried with SQL
  • Commonly lives in relational databases like Azure SQL Database

Go to the DP-900 Exam Prep Hub main page.