DP-700 Practice Exam #3 (30 questions with answers)


Question 1

You need to provide data engineers access to create Lakehouses and Data Warehouses within a Fabric workspace. However, they should not be able to manage workspace permissions.

Which workspace role should you assign?

A. Viewer

B. Contributor

C. Member

D. Admin

Answer: C

Explanation

Members can create and manage content within a workspace but cannot perform all administrative functions available to Admins.

  • A is incorrect because Viewers have read-only access.
  • B is incorrect because Contributors cannot manage all workspace content ownership scenarios.
  • C is correct.
  • D is incorrect because Admin grants excessive permissions.

Question 2

You are implementing a Medallion architecture.

Match each layer to its primary purpose.

LayerPurpose
1. BronzeA. Business-ready curated data
2. SilverB. Raw ingested data
3. GoldC. Cleansed and validated data

Answer

  • 1 → B
  • 2 → C
  • 3 → A

Explanation

Bronze stores raw data, Silver contains cleansed and transformed data, and Gold provides analytics-ready datasets.


Question 3

A Fabric Eventstream receives telemetry from 50,000 IoT devices.

You need to route the incoming events to multiple destinations simultaneously.

Which feature should you use?

A. Dataflow Gen2

B. Eventstream routing

C. Notebook scheduling

D. Warehouse shortcuts

Answer: B

Explanation

Eventstreams can route a single event stream to multiple destinations.


Question 4

Select TWO answers.

Which destinations can Eventstreams write to directly?

A. Eventhouse

B. Lakehouse

C. SQL Database

D. Notebook

E. Activator

Answer: A, E

Explanation

Common Eventstream destinations include:

  • Eventhouse
  • Activator
  • Lakehouse
  • KQL Database

Notebook is not a direct destination.


Question 5

Fill in the blank.

A OneLake shortcut allows users to access data without creating a physical __________ of the data.

Answer

Copy

Explanation

Shortcuts create logical references to data rather than duplicating storage.


Question 6

A Fabric notebook processing job frequently fails due to insufficient executor resources.

What should you do first?

A. Increase notebook timeout

B. Rebuild the Lakehouse

C. Scale Spark resources

D. Delete checkpoints

Answer: C

Explanation

Resource shortages are typically addressed by increasing Spark capacity or executor allocation.


Question 7

You are designing a real-time analytics solution.

Requirements:

  • Ingest millions of events per minute
  • Sub-second querying
  • Time-series analysis

Which technology is most appropriate?

A. Dataflow Gen2

B. Eventhouse

C. Warehouse

D. Semantic Model

Answer: B

Explanation

Eventhouse is optimized for high-volume streaming analytics and KQL workloads.


Question 8

Which KQL operator is used to aggregate records?

A. project

B. where

C. extend

D. summarize

Answer: D

Explanation

The summarize operator performs aggregations such as count(), avg(), and sum().


Question 9

A Dataflow Gen2 refresh suddenly begins failing after a source schema change.

What is the most likely cause?

A. Workspace permissions

B. Network latency

C. Column mismatch

D. Spark pool scaling

Answer: C

Explanation

Schema changes commonly cause transformation failures when expected columns are missing or renamed.


Question 10

Select THREE answers.

Which actions help improve Delta table performance?

A. OPTIMIZE

B. V-Order

C. Small file generation

D. Compaction

E. Disabling partitioning

Answer: A, B, D

Explanation

OPTIMIZE, V-Order, and compaction improve query efficiency and storage organization.


Question 11

You need to monitor Fabric pipeline execution history.

Where should you start?

A. Capacity Metrics App

B. Pipeline monitoring pane

C. Purview Hub

D. Semantic model settings

Answer: B

Explanation

Pipeline monitoring provides execution status, duration, and error details.


Question 12

A streaming solution requires SQL-like processing over continuously arriving data.

Which option should you choose?

A. Data Factory

B. Eventstream

C. Warehouse stored procedures

D. Power BI Report

Answer: B

Explanation

Eventstreams support real-time stream processing and routing.


Question 13

Match the technology to its primary purpose.

TechnologyPurpose
1. EventhouseA. Batch orchestration
2. PipelineB. Real-time analytics
3. NotebookC. Spark processing

Answer

  • 1 → B
  • 2 → A
  • 3 → C

Question 14

You need to reduce query latency against large Delta tables.

Which strategy is most effective?

A. Increase report refresh frequency

B. Create more workspaces

C. Enable V-Order optimization

D. Disable partitioning

Answer: C

Explanation

V-Order reorganizes storage to improve analytical query performance.


Question 15

A warehouse query scans billions of rows unnecessarily.

What should you investigate first?

A. Query filters

B. Workspace permissions

C. Capacity region

D. Semantic model ownership

Answer: A

Explanation

Missing predicates often cause excessive data scans.


Question 16

Select TWO answers.

Which items can be sources for OneLake shortcuts?

A. ADLS Gen2

B. Amazon S3

C. Power BI Report

D. Notebook

Answer: A, B

Explanation

Shortcuts commonly connect to ADLS Gen2 and Amazon S3 data.


Question 17

Fill in the blank.

Spark Structured Streaming processes data using incremental __________.

Answer

Micro-batches

Explanation

Structured Streaming generally operates through continuous micro-batch execution.


Question 18

A Fabric capacity consistently shows high CU utilization.

What should you do?

A. Delete semantic models

B. Reduce storage

C. Review workload consumption patterns

D. Disable monitoring

Answer: C

Explanation

Investigating workload usage is the first step before scaling or optimization.


Question 19

Which KQL function is commonly used to create time windows?

A. datetime_diff()

B. bin()

C. tostring()

D. strcat()

Answer: B

Explanation

bin() groups timestamps into fixed intervals for windowed analysis.


Question 20

You need to process late-arriving streaming events.

Which concept should be configured?

A. Watermarking

B. Capacity scaling

C. Workspace role assignment

D. Notebook scheduling

Answer: A

Explanation

Watermarks help define how late data should be handled.


Question 21

A notebook execution fails with a Python module import error.

What is the most likely cause?

A. Missing package dependency

B. Lakehouse corruption

C. Capacity throttling

D. Semantic model refresh issue

Answer: A

Explanation

Import failures typically indicate missing libraries.


Question 22

Select THREE answers.

Which metrics are useful when monitoring data ingestion?

A. Throughput

B. Latency

C. Failure rate

D. Workspace color theme

E. Event volume

Answer: A, B, C

Explanation

Throughput, latency, and failures are key ingestion health indicators.


Question 23

You want to accelerate analytical queries against external data accessed through shortcuts.

Which feature should you consider?

A. Semantic Links

B. Query Acceleration

C. Spark Monitoring

D. Activator

Answer: B

Explanation

Query Acceleration improves performance when querying external shortcut data.


Question 24

A pipeline contains five independent activities.

How can you reduce overall execution time?

A. Execute activities sequentially

B. Disable retries

C. Run activities in parallel

D. Convert activities to reports

Answer: C

Explanation

Independent tasks should execute concurrently whenever possible.


Question 25

Match the optimization technique to its purpose.

TechniquePurpose
1. PartitioningA. Reduce file count
2. CompactionB. Improve data pruning
3. CachingC. Reduce repeated reads

Answer

  • 1 → B
  • 2 → A
  • 3 → C

Question 26

Which Fabric component is best suited for orchestration of complex ETL workflows?

A. Eventhouse

B. Data Pipeline

C. Semantic Model

D. Activator

Answer: B

Explanation

Pipelines coordinate and orchestrate end-to-end workflows.


Question 27

A real-time dashboard requires alerts when sensor temperature exceeds a threshold.

Which Fabric capability is most appropriate?

A. Activator

B. Warehouse

C. Notebook

D. V-Order

Answer: A

Explanation

Activator enables event-driven actions and alerts.


Question 28

Select TWO answers.

Which actions commonly improve Spark job performance?

A. Reduce shuffle operations

B. Use broadcast joins when appropriate

C. Increase small file generation

D. Disable partition pruning

Answer: A, B

Explanation

Reducing shuffles and using broadcast joins can significantly improve Spark efficiency.


Question 29

A warehouse workload contains many repeated analytical queries.

Which optimization is most likely to help?

A. Remove statistics

B. Increase file fragmentation

C. Use result caching

D. Disable parallelism

Answer: C

Explanation

Result caching can dramatically improve repeated query performance.


Question 30

A data engineering team must investigate intermittent Eventstream ingestion failures.

What should be reviewed first?

A. Report visualizations

B. Eventstream monitoring and diagnostics

C. Semantic model relationships

D. Workspace themes

Answer: B

Explanation

Eventstream diagnostics provide details about ingestion health, routing issues, throughput bottlenecks, and destination errors, making them the best starting point for troubleshooting.


Go to the DP-700 Exam Prep Hub main page.

Leave a comment