Yes, you read the title of this post correctly, you can calculate the median and lots of other functions in Excel PivotTables besides the regular options.
If you’re a regular user of Excel PivotTables you might know you can change the summary function:
Just right-click inside of the PivotTable → Go to Summarize Values By → Select the summary function
If you click on More Options…, you can select other functions such as Standard Deviation and CountNumbers. But, what if you need something different such as the median? That’s when Power Pivot comes into play.
Power Pivot is an Excel built-in feature (for Excel 2013 and later) that allows you to significantly extend the capabilities of regular PivotTables. For example, with Power Pivot you can use information from multiple tables without having to join it into a single table. Also, you can use lots of summary functions that are not available in regular PivotTables (e.g. Median). Note: The median function is only available in Power Pivot for Excel 2016.
Check this 5 min video below to get more information about Power Pivot
To show how to calculate the median (or another measure) in PivotTables, I’ll use a sample dataset that contains shipping data. The dataset contains the following columns:
Download the raw data from this link.
The process to calculate the median (or any other function) in PivotTables is as follows:
In addition, you can specify the name of the table where the measure will be stored and a description for the measure.
For example, the image below shows a PivotTable with the column Distribution Center in the rows area and the measure Median Quantity in the values area.
Let’s say you want to calculate the median of the number of days passed from the Order Date to the Delivery Date. This date difference is called leadtime.
There are two ways of doing this:
I’ll explain how to use MEDIANX:
All Power Pivot functions ending in X are iterators. Iterator functions can perform an operation for each row of a table.
In general, the syntax for an iterator is:
=FUNCTIONNAMEX(Table, Operation)
In this case, we want to subtract the Order Date from the Delivery Date for each row of the table called Range and then we want to calculate the median of those results.
Therefore, the syntax would be:
=MEDIANX(Range, Range[Delivery Date] – Range[Order Date])
Here’s a tough challenge for you:
Let’s say there’s a target of having a leadtime (delivery date – order date) of 15 days or less. How would you calculate the percent of orders that had a leadtime greater than 15 days for each of the distribution centers?
The PivotTable should look like this:
If we pick distribution center B, 94.8% of the orders had a leadtime greater than 15 days.
HINT: Read about these Power Pivot functions: COUNTX, AVERAGEX, IF, COUNTROWS, FILTER, BLANK. You don’t need to use all of them to answer the question.
Write your solution in the comments section. By October 15, I’ll share a workbook with all the solutions posted and the names of the people who posted.
© Master Data Analysis All Rights Reserved 2024