Group and Aggregate Rows (PL-300 Exam Prep)

This post is a part of the PL-300: Microsoft Power BI Data Analyst Exam Prep Hub; and this topic falls under these sections:
Prepare the data (25–30%)
   --> Transform and load the data
      --> Group and Aggregate Rows

Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.

Grouping and aggregating rows is a foundational data preparation task used to summarize detailed data into meaningful totals before it is loaded into the Power BI data model. For the PL-300: Microsoft Power BI Data Analyst exam, Microsoft evaluates your understanding of how, when, and why to group data in Power Query, and how those decisions affect the data model and reporting outcomes.

Why Group and Aggregate Rows?

Grouping and aggregation are used to:

Summarize transactional or granular data
Reduce dataset size and improve performance
Shape fact tables to the correct grain
Prepare data for simpler reporting
Offload static calculations from DAX into Power Query

Exam Focus: The exam often tests decision-making—specifically whether aggregation should occur in Power Query or later in DAX.

Where Grouping Happens in Power BI

Grouping and aggregation for this exam objective occur in Power Query Editor, using:

Home → Group By
Transform → Group By

This transformation physically reshapes the dataset before it is loaded into the model.

Key Distinction: Power Query grouping changes the stored data. DAX measures calculate results dynamically at query time.

The Group By Operation

When using Group By, you define:

1. Group By Columns

Columns that determine how rows are grouped, such as:

Customer
Product
Date
Region

Each unique combination of these columns produces one row in the output.

2. Aggregation Columns

New columns created using aggregation functions applied to grouped rows.

Common Aggregation Functions (Exam-Relevant)

Power Query supports several aggregation functions frequently referenced on the PL-300 exam:

Sum – Adds numeric values
Count Rows – Counts rows in each group
Count Distinct Rows – Counts unique values
Average – Calculates the mean
Min / Max – Returns lowest or highest values
All Rows – Produces nested tables for advanced scenarios

Exam Tip: Be clear on the difference between Count Rows and Count Distinct—this is commonly tested.

Grouping by One vs Multiple Columns

Grouping by a Single Column

Used to create high-level summaries such as:

Total sales per customer
Number of orders per product

Results in one row per unique value.

Grouping by Multiple Columns

Used when summaries must retain more detail, such as:

Sales by customer and year
Quantity by product and region

The output grain is defined by the combination of columns.

Impact on the Data Model

Grouping and aggregating rows in Power Query has a direct impact on the data model, which is an important exam consideration.

Key Impacts:

Reduced row count improves model performance
Changes the grain of fact tables
May eliminate the need for certain DAX measures
Can simplify relationships by reducing cardinality

Important Trade-Off:

Once data is aggregated in Power Query:

You cannot recover lower-level detail
You lose flexibility for drill-down analysis
Time intelligence and slicer-driven behavior may be limited

Exam Insight: Microsoft expects you to recognize when aggregation improves performance and when it limits analytical flexibility.

Group and Aggregate vs DAX Measures (Highly Tested)

Understanding where aggregation belongs is a core PL-300 skill.

Group in Power Query When:

Aggregation logic is fixed
You want to reduce data volume
Performance optimization is required
The dataset should load at a specific grain

Use DAX Measures When:

Aggregations must respond to slicers
Time intelligence is required
Users need flexible, dynamic calculations

Common Mistakes (Often Tested)

These are frequent pitfalls that appear in exam scenarios:

Grouping too early, eliminating needed detail
Aggregating data that should remain transactional
Using Sum on columns that should be counted
Confusing Count Rows with Count Distinct
Grouping in Power Query when a DAX measure is more appropriate
Forgetting to validate results after grouping
Incorrect data types causing aggregation errors

Exam Pattern: Many questions present a “wrong but plausible” grouping choice—look carefully at reporting requirements.

Best Practices for PL-300 Candidates

Understand the grain of your data before grouping
Group only when it adds clear value
Validate totals after aggregation
Prefer Power Query grouping for static summaries
Use DAX for dynamic, filter-aware calculations
Know when not to group:
- When users need drill-down capability
- When calculations must respond to slicers
- When time intelligence is required
- When future reporting needs are unknown

How This Appears on the PL-300 Exam

Expect scenario-based questions such as:

You need to reduce model size and improve performance. Where should aggregation occur?
Which aggregation produces unique counts per group?
What is the impact of grouping data before loading it into the model?
Why would grouping in Power Query be inappropriate in this scenario?

Key Takeaways

✔ Grouping is performed in Power Query, not DAX
✔ Aggregation reshapes data before modeling
✔ Grouping impacts performance, flexibility, and grain
✔ Know both when to group and when not to
✔ This topic tests data modeling judgment, not just mechanics

Practice Questions

Go to the Practice Exam Questions for this topic.

The Data Community

Group and Aggregate Rows (PL-300 Exam Prep)

Why Group and Aggregate Rows?

Where Grouping Happens in Power BI