This post is a part of the PL-300: Microsoft Power BI Data Analyst Exam Prep Hub; and this topic falls under these sections:
Prepare the data (25–30%)
--> Transform and load the data
--> Group and Aggregate Rows
Note that there are 10 practice questions (with answers and explanations) for each section to help you solidify your knowledge of the material. Also, there are 2 practice tests with 60 questions each available on the hub below the exam topics section.
Grouping and aggregating rows is a foundational data preparation task used to summarize detailed data into meaningful totals before it is loaded into the Power BI data model. For the PL-300: Microsoft Power BI Data Analyst exam, Microsoft evaluates your understanding of how, when, and why to group data in Power Query, and how those decisions affect the data model and reporting outcomes.
Why Group and Aggregate Rows?
Grouping and aggregation are used to:
- Summarize transactional or granular data
- Reduce dataset size and improve performance
- Shape fact tables to the correct grain
- Prepare data for simpler reporting
- Offload static calculations from DAX into Power Query
Exam Focus: The exam often tests decision-making—specifically whether aggregation should occur in Power Query or later in DAX.
Where Grouping Happens in Power BI
Grouping and aggregation for this exam objective occur in Power Query Editor, using:
- Home → Group By
- Transform → Group By
This transformation physically reshapes the dataset before it is loaded into the model.
Key Distinction: Power Query grouping changes the stored data. DAX measures calculate results dynamically at query time.
The Group By Operation
When using Group By, you define:
1. Group By Columns
Columns that determine how rows are grouped, such as:
- Customer
- Product
- Date
- Region
Each unique combination of these columns produces one row in the output.
2. Aggregation Columns
New columns created using aggregation functions applied to grouped rows.
Common Aggregation Functions (Exam-Relevant)
Power Query supports several aggregation functions frequently referenced on the PL-300 exam:
- Sum – Adds numeric values
- Count Rows – Counts rows in each group
- Count Distinct Rows – Counts unique values
- Average – Calculates the mean
- Min / Max – Returns lowest or highest values
- All Rows – Produces nested tables for advanced scenarios
Exam Tip: Be clear on the difference between Count Rows and Count Distinct—this is commonly tested.
Grouping by One vs Multiple Columns
Grouping by a Single Column
Used to create high-level summaries such as:
- Total sales per customer
- Number of orders per product
Results in one row per unique value.
Grouping by Multiple Columns
Used when summaries must retain more detail, such as:
- Sales by customer and year
- Quantity by product and region
The output grain is defined by the combination of columns.
Impact on the Data Model
Grouping and aggregating rows in Power Query has a direct impact on the data model, which is an important exam consideration.
Key Impacts:
- Reduced row count improves model performance
- Changes the grain of fact tables
- May eliminate the need for certain DAX measures
- Can simplify relationships by reducing cardinality
Important Trade-Off:
Once data is aggregated in Power Query:
- You cannot recover lower-level detail
- You lose flexibility for drill-down analysis
- Time intelligence and slicer-driven behavior may be limited
Exam Insight: Microsoft expects you to recognize when aggregation improves performance and when it limits analytical flexibility.
Group and Aggregate vs DAX Measures (Highly Tested)
Understanding where aggregation belongs is a core PL-300 skill.
Group in Power Query When:
- Aggregation logic is fixed
- You want to reduce data volume
- Performance optimization is required
- The dataset should load at a specific grain
Use DAX Measures When:
- Aggregations must respond to slicers
- Time intelligence is required
- Users need flexible, dynamic calculations
Common Mistakes (Often Tested)
These are frequent pitfalls that appear in exam scenarios:
- Grouping too early, eliminating needed detail
- Aggregating data that should remain transactional
- Using Sum on columns that should be counted
- Confusing Count Rows with Count Distinct
- Grouping in Power Query when a DAX measure is more appropriate
- Forgetting to validate results after grouping
- Incorrect data types causing aggregation errors
Exam Pattern: Many questions present a “wrong but plausible” grouping choice—look carefully at reporting requirements.
Best Practices for PL-300 Candidates
- Understand the grain of your data before grouping
- Group only when it adds clear value
- Validate totals after aggregation
- Prefer Power Query grouping for static summaries
- Use DAX for dynamic, filter-aware calculations
- Know when not to group:
- When users need drill-down capability
- When calculations must respond to slicers
- When time intelligence is required
- When future reporting needs are unknown
How This Appears on the PL-300 Exam
Expect scenario-based questions such as:
- You need to reduce model size and improve performance. Where should aggregation occur?
- Which aggregation produces unique counts per group?
- What is the impact of grouping data before loading it into the model?
- Why would grouping in Power Query be inappropriate in this scenario?
Key Takeaways
✔ Grouping is performed in Power Query, not DAX
✔ Aggregation reshapes data before modeling
✔ Grouping impacts performance, flexibility, and grain
✔ Know both when to group and when not to
✔ This topic tests data modeling judgment, not just mechanics
Practice Questions
Go to the Practice Exam Questions for this topic.

One thought on “Group and Aggregate Rows (PL-300 Exam Prep)”