Power Query: Unlimited Importing of Non-Delimited Text Files
10 March 2017
Welcome to our Power Query blog. Today I look at importing data from non-delimited text files.
As a programmer, I was often asked to produce delimited versions of reports which could easily be picked up by Excel. Whilst it is possible to clean non-delimited files in Excel, it’s a laborious and repetitive process, which is why it made more sense to pay for a programmer to automate it. Power Query is a free method of cleaning up these files, and since the steps are recorded, it can be reapplied.
I begin by creating a new query from my file, as shown below:
data:image/s3,"s3://crabby-images/f3ab7/f3ab7ca7bbc2767b646be3772f0c6f08ef646032" alt=""
I browse to my file and select it. Power Query has had a valiant effort at delimiting my data as there happened to be a colon between the ‘Name’ title and the name, but most of it is in one column. My goal is to split the data into more columns.
data:image/s3,"s3://crabby-images/801fe/801fed783c00c86bb43c5968ac0a6f353bbfc891" alt=""
The first four rows don’t include any data I want, so I am going to edit and get rid of these. In the ‘Home’ tab, there is a section to do just this:
data:image/s3,"s3://crabby-images/6a415/6a4159e7e40b656964da0da3cdf94e079817e653" alt=""
I remove the first four rows and then the blank row after the Name row. I could have done this in one step by just removing the blank rows, as shown below:
data:image/s3,"s3://crabby-images/483bc/483bcba591541fb16d21b85f0f8599fc698479d1" alt=""
Now I want to populate Column2 all the way down. I described this process in detail in Getting Started where I replace the blank names with null and then fill down. I can then remove the first row and rename the column:
data:image/s3,"s3://crabby-images/b395c/b395c31dd3882c4fbed394fbcdcd1a76ed04ef6d" alt=""
In order to make sure my data in Column1 is as clean as it can be, I trim and clean it in order to remove any leading and trailing spaces, along with any annoying escape codes. Right-clicking the column reveals these options:
data:image/s3,"s3://crabby-images/9b128/9b128279ea0875d05b1af4bc2733db5619d836b2" alt=""
The next step is to break up the data in column 1, and to do this, there is a ‘Split Column’ option in the ‘Home’ section:
data:image/s3,"s3://crabby-images/e8c3d/e8c3d597e6fa0ad62c63053e87347d7886fe8869" alt=""
I make a guess at where to split my data – since I can edit the step by clicking on the gear next to the step in the ‘APPLIED STEPS’ window, I can adjust this until my data looks right.
data:image/s3,"s3://crabby-images/b2dac/b2dacd4a3ce7efbd77f3f22c8810126c9e6aed58" alt=""
My date data looks good. For my new column Column1.2, I use the ‘£’ sign as a delimiter instead of using a count because my expense codes vary in length:
data:image/s3,"s3://crabby-images/2c1eb/2c1ebdb1209cfe51855ce9d072de82b1db5364d0" alt=""
The end result looks promising.
data:image/s3,"s3://crabby-images/1d0e7/1d0e7688e74e76db43809e7c60072a2cc8ee0ccf" alt=""
There’s not much point trying to use my top row as headings, so I delete the top row and rename the columns.
data:image/s3,"s3://crabby-images/2e986/2e986979a7da0be7d37b70a04079500e58c50077" alt=""
The data looks ready to load. This query can be applied to another delimited expenses text file that comes through in a similar format, and the steps can be adjusted as required.
Want to read more about Power Query? A complete list of all our Power Query blogs can be found here. Come back next time for more ways to use Power Query!