Power Query: Project Population – Part 6
31 May 2023
Welcome to our Power Query blog. This week, I prepare my second query from a public data source.
I have found some information on population growth provided by The World Bank, which I am using as an example of how to transform real-life data.
data:image/s3,"s3://crabby-images/4523a/4523a1aff16d814f197fc3fc34c1bae4d130db3d" alt=""
I have been transforming the data, and in Part 5, I finished tidying up the main
query: Country:
data:image/s3,"s3://crabby-images/b9204/b92049b0f0a0434dd91a841433510703698e6e08" alt=""
Now that all the columns in this query have consistent data, I will look at the data that I can add from the other queries I imported. Today, I will look at the Country-Series query to see if there is any information that could be useful.
data:image/s3,"s3://crabby-images/6abc2/6abc23f9cc372dcc77b015d1e86769c7b97d7ede" alt=""
The first thing I notice is that the headers have not been promoted. This is because the algorithms have not detected a difference between the headings and the other data in the column since everything is of data type text. I can fix this by using the ‘Use First Row as Headers’ option on the Home tab.
data:image/s3,"s3://crabby-images/1d1a4/1d1a4811ba5505d0578eb62871841f6925477705" alt=""
However, since this will also generate a ‘Changed Type’ step, I will first delete the current ‘Changed Type’ step:
data:image/s3,"s3://crabby-images/32493/324937231f03db633efdfd482ec82fe4ece55fd1" alt=""
This query gives me information about the source of the population data in the Data query. I decide that I do not need the SeriesCode column, so I select the CountryCode and DESCRIPTION, and right-click to ‘Remove Other Columns’:
data:image/s3,"s3://crabby-images/4dc5b/4dc5b18b9d61278a61b1c67757360d9a5420ea0c" alt=""
Ideally, I would like to have one description for each country. I start by selecting both columns and accessing the right-click menu, where I can ‘Remove Duplicates’:
data:image/s3,"s3://crabby-images/9476a/9476ae3e2c636db65f4ddec433cfaedc8fcef5b5" alt=""
However, there are still multiple descriptions for some countries. I can use a trick to combine the text. I can use ‘Group By’ which is available on the Home tab, and here, on the Transform tab:
data:image/s3,"s3://crabby-images/8a16a/8a16ad05886dcfc8d5d969a4d33adf42c1a87415" alt=""
I want to ‘Group By’ CountryCode, and concatenate the DESCRIPTION rows. However, since I have both columns selected, the dialog is in ‘Advanced’ mode, and prompting me to ‘Group By’ CountryCode and DESCRIPTION. I will sort this out later.
data:image/s3,"s3://crabby-images/58e08/58e0804f81725647bfdb98fc36156abad1147a86" alt=""
I want to create a new column which I call Description, where I will sum the DESCRIPTION rows:
data:image/s3,"s3://crabby-images/a3a04/a3a0423bf4ae206fcf3ea8779c7dda7ebd3dc30c" alt=""
Unsurprisingly, Power Query is not happy about summing a text value:
data:image/s3,"s3://crabby-images/74af2/74af28eff3ee66cd67fb08b40b6f558a4bc71d46" alt=""
Next time, I will convert the M code into something more acceptable…
Come back next time for more ways to use Power Query!