DataViz Makeover 1

This visualisation presents the growth of Singapore’s Labour Force Participation Rate (LFPR) from 2009 to 2019.

Andy WONG https://www.linkedin.com/in/andywongkh
01-28-2021

1.0 Overview

The data visualisation is published on Tableau Public.

1.1 Clarity vs aesthetics

The original chart is presented below:

Source: https://stats.mom.gov.sg/Pages/Labour-Force-In-Singapore-2019.aspx

In terms of aesthetics, the chart above looks acceptable. Font styles and colour schemes are easy on the eyes.

However, the presentation of the data itself could be clearer.

Based on this assessment, the chart falls into Quadrant IV of the matrix:

Source of clarity vs aesthetics matrix: https://dataremixed.com/2012/05/data-visualization-clarity-or-aesthetics

2.0 Makeover

2.1 Critique of the visualisation, and suggested improvements

2.1.1 Clarity

S/n Critique Suggestions for improvement
1 Unclear subtitle. It is not clear what the subtitle “Per Cent” refers to. It almost looks to be put there as an afterthought. Remove the subtitle and label the appropriate axis with “%”. This will eliminate uncessary text, yet present the percentage values in a more intuitive manner.
2 The x-axis is not explained. There are no labels to explain what the three types of values on the x-axis mean. Pivot the data such that the percentages are on the x-axis and the age group is on the y-axis, and label the axes accordingly. This will improve the clarity of the visualisation.
3 Age Group is not presented clearly. Compounded with the blue and grey lines of the line graph, it looked as though a timeline was being presented. Only upon further examination was it understood to be a presentation of the proportion of age group in percentage. Pivot the data such that age group is presented on the y-axis. This will improve the intuitiveness of the visualisation.
4 Duplicate presentation of information. The two lines of the line graph essentially present the same information as the percentage values in the x-axis. The line graph presents the trend visually, while the x-axis provides the actual percentage values. This duplication would not be needed with better visualization design. Remove the June 2009 and June 2019 percentage values from the x-axis. This will no longer be necessary once the data is presented appropriately.

2.1.2 Aesthetics

S/n Critique Suggestions for improvement
1 Good use of colour to differentiate the two time periods. A more contrasting colour scheme might be better. Use more appealing colours to enhance the contrast and clarity of the two sets of data being compared.
2 Clear labeling of what the two lines in the line graph represent, and the median ages. However, better designed aesthetics could allow for more concise labeling (using less words). Include labels as necessary to improve conciseness and clarity.
3 With thee different values, the x-axis is too cluttered. This will be solved once the data is pivoted appropriately, as mentioned in section 2.1.1.
4 A minor point: although best practice states that number ranges should start from 0, there is no need for the age group to start from 0 in this case, as this graph is meant to present the labour force which consists of only persons aged 15 years and over. The age groups presented covers the entire range of legal working age ranges. No action necessary.

2.2 Sketch of the alternative visualisation

The sketch below is the proposed improved visualisation to address the critique presented in section 2.1.

Proposed visualisation

2.3 Building the visualisation

This visualisation was built using Tableau Desktop Professional Edition 2020.4.0.

Step 1. Import data into Tableau
Import Excel file into Tableau.


Step 2. The data is not standardised and cleaned because the formatting of the spreadsheet is not friendly for data importation.


Step 3. Clean the data
Click on the “Cleaned with Data Interpreter” to provide an initial clean-up of the data to allow for easier manual cleaning.


Step 4. Hide uneccessary columns
Hide the left-most column as it is not useful for this analysis.


Step 5. Rename columns
Rename the “Age (Years) 1” column to “Age Group” for clarity.


Step 6. Check data type Age column
Check that the “Age” column is of the correct data type (String).


Step 7. Check data type of year columns
Check that the rest of the year columns are of the correct data type (Number[decimal]).


Step 8. Define Column and Row data
Drag “2009” and “2019” to the Columns pane and “Age Group” to the Rows pane.


Step 9. Exclude irrelevant columns
Exclude the “NULL” and “Total” columns from the chart as these are not useful for the analysis.

Step 10. Exclude overlapping age group
Exclude the overlapping age groups, as these will confuse the data presented in the visualisation.


Step 11. Sort Age Group
Sort “Age Group” in descending order.

Step 12. Superimpose the two charts
Select “Dual Axis” to combine the two charts into one.


Step 13. Synchronise the two y-axes
Select “Synchronise Axis” to ensure both charts’ y-axes match.


Step 14. Remove header
Remove “Show Header” to show only one y-axis.


Step 15. Change visualisation for 2019 chart
Change the 2019 Mark to Bar chart.


Step 16. Reorder 2019 and 2009 charts
Swap “SUM(2019)” to the front of the Columns pane to display the 2019 chart behind the 2009 chart.


Step 17. Show whole range of x-axis
Edit the range of the x-axis to show the whole range of 0% to 100%. Rename label as “LFPR (%)”.


Step 18. Expand chart view
Change the view to “Entire View” for visual clarity.


Step 19. Edit chart title


Step 20. Edit Legend title
Edit the Legend title to “Year”.


Step 21. Rename the Sheet


Step 22. Completed visualisation
The completed chart looks like this:

2.4 Key observations from the visualisation

S/n Observation
1 The largest increase in LFPR from 2009 to 2019 was among the 50 to 64, and 65 to 69 age groups. This kind of shift in LFPR was to be expected of an ageing population as more older workers joined the workforce than their younger counterparts.
2 The 25 to 29 age group had the highest LFPR in 2009. Ten years later in 2019, the age group with the highest LFPR was 30 to 34, which was one age group younger than the original 25 to 29 age group. The showed that from 2009 to 2019, the age group with the highest LFPR moved down by one group. The reason for this trend is outside the scope of this analysis but is a potential area of interest for further study.
3 Of all the age groups, only the 20 to 24 age group declined in LFPR from 2009 to 2019. This was the only exception among all the age groups. As this was typically the age group of university undergraduates, it may have signified that more people in this age group were attending university. However, this point warrants further study.
4 Overall, the lower the age group, the slower the increase in LFPR from 2009 to 2019, with the exception of the 15 to 19 age group, which seemed to reverse the trend compared to the next older age group (20 to 24). This seemed to contradict Singapore’s Total Fertility Rate (TFR) which has been declining consistently, but may have been bolstered by the slight up-tick in TFR in 2000.

Thanks for visiting my blog!

This post is a data visualisation assignment for the MITB programme at the Singapore Management University.

Distill is a publication format for scientific and technical writing, native to the web.
Learn more about using Distill at https://rstudio.github.io/distill.