Power Query: Project Population – Part 14
26 July 2023
Welcome to our Power Query blog. This week, I complete the Project Population series.
I have found some information on population growth provided by The World Bank, which I am using as an example of how to transform real-life data.
data:image/s3,"s3://crabby-images/dc1fb/dc1fbc87c413f68fa13562cd39ad4f47763c831c" alt=""
I have been transforming the data, and last week, I created a query by merging Country and Data, using Left Anti join, to get the rows from Data which don’t match a Country Code on Country(or, by extension, Population Estimates by Country).
data:image/s3,"s3://crabby-images/2136b/2136b3a920f71331216964cfb0f8cf7e30d41d6b" alt=""
This time, I will tidy up this query. The first thing I shall do is to make the existing Source step more efficient. The M code is currently:
= Table.NestedJoin(Data, {"Country Code"}, Country,
{"Country Code"}, "Country", JoinKind.LeftAnti)
In this case, I definitely do not want to change this to use Table.Join(),since I don’t need any columns from Country. I do however want to use Table.Buffer() to speed up the merge:
= Table.NestedJoin(Table.Buffer(Data), {"Country Code"},
Table.Buffer(Country), {"Country Code"}, "Country",
JoinKind.LeftAnti)
As usual, this has no impact on the appearance of my query:
data:image/s3,"s3://crabby-images/e4db1/e4db150fab8170246312a319708d4677c39633ea" alt=""
Since merging always follows the same process, even though I have used a ‘Left Anti’ join to find those Country Code values on Data that did not have a match in Country, I still have a column where I could extract the Country columns:
data:image/s3,"s3://crabby-images/581ba/581ba8a14df447feaef3599b1e497ab240f5f99e" alt=""
Unsurprisingly, if I click on the space next to one of the ‘Table’ values, there is not much to see:
data:image/s3,"s3://crabby-images/65326/65326d0e12f3f30d50ce79f354c12d7bdf272404" alt=""
Therefore, I delete the Country column by selecting it and pressing the Delete key.
data:image/s3,"s3://crabby-images/baf9d/baf9d6e76ec054ce2b80b1bf8d642e754370500a" alt=""
Since I know that the Country Code column in this query is representing more than just one country, I change the column name to Region Code. Similarly, I rename Country Name to Region Name. I also rename the query Region Data:
data:image/s3,"s3://crabby-images/31995/3199582a99c8f8b39eece59117c1328f5dd2af04" alt=""
I could wait until I upload the query to see how many rows are uploaded, but there is another way. I select the Region Code column, and on the Transform tab, I choose ‘Count Values’.
data:image/s3,"s3://crabby-images/6f376/6f37691564cf49f7fbe54f608b86a2f90404fc3a" alt=""
This will tell me how many rows in this column have values other than blank or null.
data:image/s3,"s3://crabby-images/57803/57803fd279245bd959ef7a825cee5d8716a44d9e" alt=""
My answer is 5,940 rows. Note that this uses List.NonNullCount(), so if I wanted to include this data in my query, I could put this M code into a ‘Custom Column’ using the option on the ‘Add Column’ tab:
data:image/s3,"s3://crabby-images/0f28f/0f28f91c3e5b762c2db131416eac661e7b744fa3" alt=""
The number of rows is now shown with my data and I move it to the beginning of the query by right-clicking on the Count column and choosing ‘Move to Beginning’:
data:image/s3,"s3://crabby-images/0bc3b/0bc3bd8f692112ff499f36956aa8fc9129feeddf" alt=""
Finally, I will load the queries I have created into Excel (having changed the data type of Count to a Whole Number). I choose ‘Close & Load’, and right-click on the query I wish to load to the current sheet. This presents me with the ‘Import Data’ dialog, where I choose to load to Table, and select a cell on the existing worksheet:
data:image/s3,"s3://crabby-images/36427/3642766c932dd53893cf9d733564cd264af7a9d4" alt=""
I can repeat this process for the Region Data query:
data:image/s3,"s3://crabby-images/4fd4b/4fd4bd56af28b35cbcd92499359d1e2e1a1ff269" alt=""
I’ve proved that the count is correct!
Come back next time for more ways to use Power Query!