How to create a Sankey chart.

While Sankey charts may seem daunting at first, they’re a great way to show a change of flow between more than one category or condition.

For this example we’re looking at how the sum of sales changes between Regions and Categories in the Tableau Superstore data.

Figure 1: The Sankey Chart we will be making.

Step 1|

To create the above Sankey chart you will need to copies of your data. One for the categories on the left hand side of our chart, and another for the regions on the right hand side. Union your data to itself to get these two copies.

Once unioned, Tableau will create a new field called ‘Table Name’. This field, and it’s contents, will be used in calculations later on.

Step 2|

Create a new calculated field, I’ve called mine ToPad. This calculated field will mean that one copy of our data will be labelled 1 and the other 49.

Figure 2: Calculated Field – ToPad

When this field is added to the field, we can see it has values for 1 and 49, but nothing in between. A value of 49 here will mean that when Tableau draws the curve of our sankey chart, it will be smooth and not jagged.

Step 3|

By creating bins on the ToPad field, we can simulate values between 2 and 48.

Figure 3: Create Bins for the ToPad field.

Step 4|

Create a new calculated field, I’ve called mine T. This field will ensure that are marks are spread out across the view.

Figure 4: Calculated Field -T.

Step 4|

Add the calculated field ‘T’ to the columns and add ‘Padding’ to the details shelf. Then change your mark type to a circle, this will make it easier to visualize the next steps.

Figure 5:

As you can see, only one mark is shown. Right click on the T field, and select ‘Compute Using’ and ‘padding’. This will show all of your marks.

Step 5|

Create a new calculated field, Rank 1. Duplicate this field and call the new field, Rank 2.

Figure 6: Rank Calculation

These calculations will help inform Tableau where the lines of the sankey chart should start and end.

Step 6|

The signature s-shaped curve of a sankey chart is a sigmoid curve. So create a calculated field that will show tableau how to draw that characteristic s-shaped curve.

Figure 7: Sigmoid Curve Calculation

Step 7|

Using the previous calculated fields, you will need to create one final calculated field that will build out the curve of the sankey chart.

Figure 8: Curve Calculation

This calculation will go from the starting point (Rank 1) to the finishing point (Rank2-Rank 1) drawing a sigmoid curve between them.

Step 8|

Add your new calculated field, curve, to the rows shelf.

Initially, only two marks will appear, one on the extreme edge and one on the extreme right. This is because tableau hasn’t been told how to calculate the curve.

Step 9|

Add Region and Category onto the details shelf. Right click on your curve pill and select edit table calculation. Select Compute Using Specific dimensions, using the dimensions specified below, in that order.

Step 10|

You will need to edit your axis. I’ve made the X-Axis between -5 and 5, and the Y-Axis between 0 and 1. The Y-Axis also needs to be reversed, so the points that appear first alphabetically are at the top of the axis.

Figure 11: The shape of the sankey chart is coming together.

Once this is done, hide your headers.

Step 11|

Change your mark type to a line. Initially, the chart will look strange, this is because Tableau hasn’t been told how to draw the line. Make sure that your Padding field is on the paths shelf.

Figure 12: Once the mark type has been changed to a line, this is how your sankey chart should look.

Step 12|

To size the lines of the chart based on the overall sales, we need to create one final calculated field.

Figure 13: Calculation for Sales Sizing.

Add this new calculated field onto the sizing shelf and compute using ‘Padding’.

Step 13|

Add Category and Region to your colour shelf.

Figure 14: The central curve for the sankey chart, completed.

Step 14|

In two separate sheets, create a stacked bar chart for % of total sum of sales, coloured by Category and Region.

Step 15|

The last step is straightforward. Combine the three sheets you’ve made onto one dashboard.

I hope this blog post has helped you to understand how to create a sankey chart. I know writing it definitely helped cement my understanding of the process.

Author:
Brandon Fitz-Gerald
Powered by The Information Lab
1st Floor, 25 Watling Street, London, EC4M 9BR
Subscribe
to our Newsletter
Get the lastest news about The Data School and application tips
Subscribe now
© 2024 The Information Lab