Power Query: Group Dynamics
10 October 2018
Welcome to our Power Query blog. This week, I look at how to calculate a group share.
This week’s blog looks at how to achieve a group share calculation in Power Query, in the same way that I can perform one in Excel. This is an exercise in how to manipulate data in Power Query; it is not how I would actually attempt to do this calculation, since it is much easier to do this in Excel or PowerPivot
I have the following data. I am interested in finding out which groups make up most owned items. I want to see the percentage of owned or outsourced items for each group.
data:image/s3,"s3://crabby-images/c7e53/c7e53c657a7591d6fc93dbd812c22f944e7837de" alt=""
In Excel I can create an additional column and enter the formula
=[@[Items_in_Group]]/SUMIFS([Items_in_Group],[Owned/Outsourced],[@[Owned/Outsourced]])
which calculates the percentage of outsourced or owned items that are in the current group.
data:image/s3,"s3://crabby-images/11704/117046f5889b45ca01aea39d7fd5769856eb20e9" alt=""
I can see that ‘Floor’ items make up 40% of the owned items.
I want to perform the same calculation in Power Query, so I create a new query ‘From Table’ from the ‘Get & Transform’ section on the ‘Data’ tab.
data:image/s3,"s3://crabby-images/7b2c7/7b2c75eb9df56f38a2288d5570e039f12f0b82c9" alt=""
I start by grouping by my Owned/Outsourced column. This will allow me to make my first step - to calculate the total number of items in each group:
data:image/s3,"s3://crabby-images/1e489/1e489e943efd51e73ca95e7c02ab022c0d01c553" alt=""
I use a basic group by statement, where I sum Items_In_Group. This will simplify my data so that I can see the number of items Owned and Outsourced.
data:image/s3,"s3://crabby-images/90c53/90c5358d12c9f411902f7feb4347cac432910688" alt=""
I know that step ‘Changed Type’ shows me all the details, and ‘Grouped Rows’ shows me the total items owned and outsourced. One method I can use to get to my final calculation is to merge these steps by merging ‘Table2’ with itself (I will show another method later).
data:image/s3,"s3://crabby-images/c41f9/c41f9f17f2def86c5ee416e7eece95ff6cc03880" alt=""
I choose the first column each time, and select the left outer join.
data:image/s3,"s3://crabby-images/cf313/cf3133a3ab821a735a304c8365e6920d28f1daab" alt=""
From the M code generated for this step, I can see that the ‘Grouped Rows’ step has been merged with itself.
= Table.NestedJoin(#"Grouped Rows",{"Owned/Outsourced"},#"Grouped Rows",{"Owned/Outsourced"},"Grouped Rows",JoinKind.LeftOuter)
I can edit this step so that I merge the ‘Changed Type’ step instead.
data:image/s3,"s3://crabby-images/63517/6351745326e0aa3528732f33f26a52c2f59fd7a8" alt=""
I have now taken my ‘Changed Type’ step with all the details and merged the ‘Grouped Rows’ step. I only want Total Items Owned or Outsourced from the ‘Grouped Rows’ step, so I expand the table and pick this column.
data:image/s3,"s3://crabby-images/7ed9a/7ed9afa8d1e23aa64529d83473072cde8eb27552" alt=""
As usual, I don’t want to ‘Use original column name as prefix’!
data:image/s3,"s3://crabby-images/fe580/fe5804b3347b8e8efdd5927cb4a8791b5f88279e" alt=""
Now I have the two key values for my percentage calculation, I can create a custom column from the ‘Add Column’ tab.
data:image/s3,"s3://crabby-images/23ace/23ace45dc64fd777b31b9835bf4cbe4314813e0f" alt=""
I divide the number of items in the group by the items owned or outsourced to see which group is mostly made up of owned and outsourced items respectively. I could also do this using the ‘Divide’ option from the ‘Standard’ part of the ‘From Number’ section.
data:image/s3,"s3://crabby-images/1b7ae/1b7ae6f9d4143029cc00bec642f1b435a9422e50" alt=""
In the same way I did for the Excel method, I convert my new column to type ‘Percentage’.
data:image/s3,"s3://crabby-images/3c80d/3c80d372b999ab28cc9daf02ec0998827a158b64" alt=""
My calculations are consistent with the Excel method.
I can achieve the same result in another way. I can use M code if I don’t want to merge my queries. Instead of using the ‘Grouped Rows’ step above, I create a custom column which will carry out the grouping for me.
data:image/s3,"s3://crabby-images/60711/60711698f75fff201c02b779b3bdb9cf5070bd04" alt=""
The M code I have used is:
= Table.AddColumn(#"Sorted Rows", "Items_Owned_Outsourced", each Table.Group(Table2, "Owned/Outsourced", {"Total", each List.Sum([Items_in_Group])}))
The part of this which creates my grouped total is Table.Group(Table2, "Owned/Outsourced", {"Total", each List.Sum([Items_in_Group])})
I have grouped by Owned/Outsourced and summed Items_in_Group within this. This gives me a table very similar to the one created by the merging of the query with itself, which I can expand.
data:image/s3,"s3://crabby-images/a4ac6/a4ac63c1c577cc9a1fdbe6ad4df2571f63910a6b" alt=""
As before, I can create my custom (or divide) column and convert it to a percentage. This time, I will use the divide functionality.
data:image/s3,"s3://crabby-images/30c8e/30c8e33af9b4bccbea4015d7d7b34232fcf0f935" alt=""
This gives me a new Division column which I can rename.
data:image/s3,"s3://crabby-images/ce70d/ce70d3625c936136e13246d8c1aa3e3dab336308" alt=""
Having converted this to the right data type, my data is consistent with the other results.
data:image/s3,"s3://crabby-images/ec292/ec2929d4c5388ff50da8765598b94f29a02b0100" alt=""
Come back next time for more ways to use Power Query!