Nominal vs. Nominal
Part 2: Visualisation (clustered bar-chart)
On the previous page we got the first impression of the data using a cross table. On this page we'll see how we can visualise those results.
Two possible diagrams could be used, depending on your situation. If you have a clear dependent and independent variable, then a so-called clustered bar-chart might be useful, if not then a spline plot could be a good choice.
An independent variable vs. dependent variable is if you think one variable might influence the other, but not the other way around. Gender is often a good example of an independent variable, since your gender will not likely change depending on something else (unless you're doing biology and are using chromosones which determine the gender of a baby). In the example used, the gender might influence the marital status, while the marital status will not influence the gender. I would therefor use a clustered bar-chart (also known as a multiple bar-chart) for the example as shown in Figure 1.
Figure 1. Results of gender vs marital status.
Click here to see how you can create a clustered bar-chart as above, with SPSS, R (Studio), Excel, or Python.
There are a two different ways to create a clustered bar chart with SPSS.
using the chart builder
using legacy dialogs
with R (Studio)
Note that in the example the column totals add up to 100% each, which makes it easy to compare the results between the two genders. Depending on your results you might prefer to set each row as 100% or even based on the grand total.
We can notice the same things as we saw with the cross table; it seems that most percentages are similar with the biggest difference between Male and Female at married and widowed.
In the report I recommend using a ‘Introduce – Show – Tell’ approach. So when reporting this graph, it could be for example like this:
Often we hear that woman are more likely to be widowed. To see if there is a relation between gender and marital status, we asked people about their marital status and gender. Figure 1 shows the results of the survey.
As can be seen Figure 1 married is still by far the modal category for both males and females. The differences between males and females seem to be small (almost none for divorced), except for widowed where there are relatively many females.
The big question is if the sample shows sufficient evidence that the differences found might also appear in the population. This will be discussed on the next page.
In case you do not have a clear independent and dependent variable, a spine plot might be preferred as the one shown below.
Click here to see how to create a spineplot with R, or with Excel (not possible with SPSS)
with SPSS (not possible)
Unfortunately I am not familiar with a way to create a spine plot in SPSS using the GUI. It might be possible by using some syntax (linking perhaps to R), but that goes beyond the scope of this course. I'd suggest using MS Excel instead.
It might take some time, but it is possible to create a spine plot with Excel, as shown in the video below.
Two nominal variables