Question 1

You need to ingest data from an on-premises SQL Server database into a Microsoft Fabric Lakehouse on a daily schedule.

Which Fabric component should you use?

A. Eventstream
B. Data Pipeline
C. KQL Queryset
D. Semantic Model

Answer: B

Explanation

Data Pipelines are designed for orchestrating and scheduling data movement and transformation activities. Eventstreams are intended for streaming data rather than scheduled batch ingestion.

Question 2

You are designing a medallion architecture in a Fabric Lakehouse.

Match each layer to its primary purpose.

Layer	Purpose
1. Bronze	A. Business-ready data
2. Silver	B. Raw ingested data
3. Gold	C. Cleaned and transformed data

Answer

1 → B
2 → C
3 → A

Explanation

Bronze stores raw source data.
Silver stores cleansed and standardized data.
Gold stores curated business-ready datasets.

Question 3

Which TWO capabilities are provided by Delta Lake tables?

(Choose two.)

A. ACID transactions
B. Schema enforcement
C. XML indexing
D. Star schema generation

Answers: A, B

Explanation

Delta Lake supports ACID transactions, schema enforcement, schema evolution, and time travel. XML indexing and star schema generation are not Delta Lake features.

Question 4

A Fabric Warehouse contains a fact table with 2 billion rows.

Users frequently filter queries by OrderDate.

What should you consider to improve performance?

A. Remove statistics
B. Convert the table to CSV files
C. Use partitioning based on OrderDate
D. Disable Delta support

Answer: C

Explanation

Partitioning on commonly filtered columns reduces data scanning and improves query performance.

Question 5

Fill in the blank.

The Fabric storage layer that provides a unified logical data lake across Fabric workloads is called __________.

Answer

OneLake

Explanation

OneLake serves as the centralized storage layer across Fabric workloads.

Question 6

You need to process IoT telemetry arriving continuously from thousands of sensors.

Which Fabric component is most appropriate?

A. Warehouse
B. Notebook
C. Eventstream
D. Dataflow Gen2

Answer: C

Explanation

Eventstreams are designed specifically for real-time ingestion and routing of streaming events.

Question 7

A notebook performs the following operations:

Reads Bronze data
Removes duplicates
Standardizes formats
Writes cleaned data

Which medallion layer is being produced?

A. Bronze
B. Silver
C. Gold
D. Semantic

Answer: B

Explanation

Silver layer contains cleansed and transformed data.

Question 8

Which THREE sources can Eventstreams directly ingest from?

(Choose three.)

A. Azure Event Hubs
B. Sample data source
C. Custom application streams
D. Power BI semantic models

Answers: A, B, C

Explanation

Eventstreams support ingestion from Event Hubs, custom streams, Fabric sources, and sample data.

Question 9

You want to create a shortcut in a Lakehouse that references data stored in another Lakehouse without duplicating the data.

What should you use?

A. Mirrored Database
B. Warehouse View
C. OneLake Shortcut
D. Dataflow Gen2

Answer: C

Explanation

OneLake shortcuts provide virtual access to data without physically copying it.

Question 10

Match each workload to the most appropriate language.

Workload	Language
1. Warehouse queries	A. KQL
2. Eventhouse analytics	B. T-SQL
3. Spark notebook transformations	C. PySpark

Answer

1 → B
2 → A
3 → C

Explanation

Warehouses use T-SQL, Eventhouses use KQL, and Spark notebooks commonly use PySpark.

Question 11

Which statement about Dataflow Gen2 is correct?

A. It is used only for streaming data.
B. It provides low-code data transformation.
C. It replaces Spark entirely.
D. It requires KQL.

Answer: B

Explanation

Dataflow Gen2 provides a Power Query-based low-code transformation experience.

Question 12

You need to deduplicate customer records during ingestion.

Which Spark function is commonly used?

A. distinct() or dropDuplicates()
B. cache()
C. collect()
D. union()

Answer: A

Explanation

dropDuplicates() and distinct() are standard Spark methods for removing duplicate records.

Question 13

A Fabric Warehouse query is running slowly.

You discover many joins use columns with mismatched data types.

What should you do?

A. Disable indexing
B. Increase storage only
C. Align data types across joined columns
D. Convert all columns to VARCHAR

Answer: C

Explanation

Matching data types allows the optimizer to generate more efficient execution plans.

Question 14

Which TWO benefits are provided by OneLake shortcuts?

(Choose two.)

A. Reduced data duplication
B. Faster networking hardware
C. Centralized access to data
D. Automatic data encryption removal

Answers: A, C

Explanation

Shortcuts minimize data duplication while enabling centralized access.

Question 15

A company wants to analyze clickstream events with second-level latency.

Which Fabric workload is best suited?

A. Eventhouse
B. Warehouse
C. Dataflow Gen2
D. Semantic Model

Answer: A

Explanation

Eventhouse is optimized for real-time analytics and telemetry workloads.

Question 16

Fill in the blank.

The Spark feature used to continuously process incoming streaming data is called Structured __________.

Answer

Streaming

Explanation

Spark Structured Streaming provides scalable stream processing.

Question 17

Which KQL operator is used to aggregate data?

A. join
B. summarize
C. extend
D. where

Answer: B

Explanation

summarize performs aggregation operations.

Question 18

A notebook fails because a referenced Delta table no longer exists.

What category of issue is this?

A. Authentication issue
B. Schema drift issue
C. Missing resource dependency
D. Capacity issue

Answer: C

Explanation

The notebook references an object that no longer exists.

Question 19

You need to schedule a notebook execution every night.

Which Fabric component should orchestrate this process?

A. Eventstream
B. Pipeline
C. Eventhouse
D. Semantic Model

Answer: B

Explanation

Pipelines provide orchestration and scheduling capabilities.

Question 20

Which operation helps compact small Delta files into larger files?

A. VACUUM
B. ANALYZE
C. OPTIMIZE
D. CACHE

Answer: C

Explanation

OPTIMIZE compacts files and improves query performance.

Question 21

A data engineer wants to improve Spark query performance for repeatedly accessed datasets.

Which technique should be used?

A. VACUUM
B. Partition deletion
C. Cache DataFrames
D. Convert to CSV

Answer: C

Explanation

Caching stores frequently accessed data in memory.

Question 22

Select all valid Fabric items.

(Choose three.)

A. Lakehouse
B. Eventhouse
C. Warehouse
D. Azure VM

Answers: A, B, C

Explanation

Lakehouse, Eventhouse, and Warehouse are Fabric items. Azure VM is not.

Question 23

A streaming solution must enrich incoming events with reference data.

Which technology is most appropriate?

A. Structured Streaming
B. PowerPoint
C. Semantic Model Refresh
D. OneLake Shortcut

Answer: A

Explanation

Structured Streaming supports stream-to-static joins for enrichment.

Question 24

Match the monitoring tool to its purpose.

Tool	Purpose
1. Monitoring Hub	A. View workload execution history
2. Capacity Metrics	B. Analyze capacity utilization
3. Spark Monitoring	C. Diagnose Spark execution

Answer

1 → A
2 → B
3 → C

Explanation

Each tool focuses on a different operational aspect of Fabric.

Question 25

Which statement about Eventhouse is TRUE?

A. It only supports batch data.
B. It stores data using KQL databases.
C. It replaces OneLake.
D. It cannot query streaming data.

Answer: B

Explanation

Eventhouse uses KQL databases optimized for real-time analytics.

Question 26

A Lakehouse table experiences slow query performance due to excessive small files.

What is the best first action?

A. Delete the table
B. Run OPTIMIZE
C. Remove partitions
D. Disable Delta

Answer: B

Explanation

OPTIMIZE consolidates files and improves query efficiency.

Question 27

A company needs historical tracking of changes to Delta tables.

Which feature provides this capability?

A. Time Travel
B. Direct Lake
C. Eventstream
D. Monitoring Hub

Answer: A

Explanation

Delta Time Travel enables querying previous versions of data.

Question 28

You need to identify why a pipeline failed overnight.

Where should you start?

A. Monitoring Hub execution details
B. Power BI report settings
C. Semantic model relationships
D. Capacity scaling settings

Answer: A

Explanation

Monitoring Hub provides execution history, logs, and failure details.

Question 29

Which TWO actions commonly improve Warehouse query performance?

(Choose two.)

A. Maintain statistics
B. Use partitioning strategically
C. Convert all columns to strings
D. Use SELECT *

Answers: A, B

Explanation

Statistics and partitioning improve optimization and reduce scanning.

Question 30

A company wants Power BI reports to query Lakehouse data with minimal data duplication and high performance.

Which storage mode should be considered first?

A. Import Mode
B. DirectQuery
C. Direct Lake
D. Live Connection to Excel

Answer: C

Explanation

Direct Lake allows Power BI to access OneLake data directly while delivering near-import performance without data duplication.

Go to the DP-700 Exam Prep Hub main page.