Practice Questions
Question 1
Which statement best describes semi-structured data?
A. Data stored strictly in rows and columns
B. Data with no identifiable organization
C. Data that uses a flexible structure with self-describing elements
D. Data that can only be stored in relational databases
✅ Answer: C
Explanation:
Semi-structured data does not use rigid tables but contains self-describing elements (such as key–value pairs or tags) that provide partial structure.
Question 2
Which of the following is a common format for semi-structured data?
A. CSV
B. JSON
C. JPEG
D. MP4
✅ Answer: B
Explanation:
JSON is one of the most common semi-structured formats used in APIs, applications, and document databases.
Question 3
Semi-structured data typically uses which schema approach?
A. Schema-on-write
B. Schema-on-delete
C. Schema-on-read
D. Fixed schema
✅ Answer: C
Explanation:
Semi-structured data uses schema-on-read, meaning structure is applied when the data is queried, not when it is stored.
Question 4
Which Azure service is commonly used to store JSON-based semi-structured data?
A. Azure SQL Database
B. Azure Blob Storage only
C. Azure Cosmos DB
D. Azure Files
✅ Answer: C
Explanation:
Azure Cosmos DB is a globally distributed NoSQL service designed to store semi-structured JSON documents.
Question 5
Which characteristic differentiates semi-structured data from structured data?
A. It cannot be queried
B. It requires primary keys
C. It allows records with different fields
D. It must be stored in spreadsheets
✅ Answer: C
Explanation:
In semi-structured data, individual records can have different attributes, unlike structured data which enforces uniform columns.
Question 6
What does it mean when semi-structured data is described as self-describing?
A. It automatically documents itself
B. It contains embedded field names or tags
C. It always includes metadata files
D. It uses SQL syntax
✅ Answer: B
Explanation:
Semi-structured data includes keys or tags (like JSON property names) that describe the values they contain.
Question 7
Which scenario best represents semi-structured data?
A. A customer table with fixed columns
B. A collection of images
C. Application logs stored as JSON documents
D. Audio recordings
✅ Answer: C
Explanation:
JSON-based application logs are classic examples of semi-structured data.
Question 8
Why is semi-structured data well suited for rapidly changing applications?
A. It enforces strict schemas
B. It supports schema-on-read and flexible structures
C. It requires fewer storage resources
D. It must be normalized
✅ Answer: B
Explanation:
Semi-structured data allows flexible schemas, making it ideal when data models evolve frequently.
Question 9
Which feature allows nested objects in semi-structured data?
A. Tabular organization
B. Hierarchical structure
C. Index clustering
D. Column constraints
✅ Answer: B
Explanation:
Semi-structured data supports hierarchical and nested structures, such as JSON objects inside other objects.
Question 10
Which workload most commonly produces semi-structured data?
A. Financial ledger systems
B. Payroll databases
C. Web APIs and application telemetry
D. Spreadsheet reporting
✅ Answer: C
Explanation:
Web services, application telemetry, and IoT systems frequently generate JSON or similar semi-structured formats.
✅ Quick Exam Takeaways
For DP-900, remember:
- Semi-structured data uses schema-on-read
- Records can have different fields
- Supports nested / hierarchical structures
- Common formats: JSON, XML
- Common Azure service: Azure Cosmos DB
- Ideal for applications, telemetry, logs, and IoT
Go to the DP-900 Exam Prep Hub main page.
