Start learning with our library of video tutorials taught by experts. Get started

SPSS Statistics Essential Training

Creating crosstabs for categorical variables


From:

SPSS Statistics Essential Training

with Barton Poulson

Video: Creating crosstabs for categorical variables

In the last two movies we looked at ways to assess the relationships between two variables. We looked at correlations, which work for pretty much any kind of variable, and we looked at bivariate linear regression, a closely related procedure, but one that doesn't work with categorical outcome variables. If you do have a categorical outcome variable and a categorical predictor, you can still use correlations as long as those variables are coded as 01 indicator variables. But it's more common to use what's called a crosstabulation or crosstab for short.
Expand all | Collapse all
  1. 2m 58s
    1. Welcome
      1m 5s
    2. Using the exercise files
      40s
    3. Using a different version of the software
      1m 13s
  2. 19m 0s
    1. Taking a first look at the interface
      11m 49s
    2. Reading data from a spreadsheet
      7m 11s
  3. 21m 54s
    1. Creating bar charts for categorical variables
      7m 18s
    2. Creating pie charts for categorical variables
      2m 54s
    3. Creating histograms for quantitative variables
      5m 45s
    4. Creating box plots for quantitative variables
      5m 57s
  4. 33m 10s
    1. Recoding variables
      5m 33s
    2. Recoding with visual binning
      5m 33s
    3. Recoding by ranking cases
      5m 26s
    4. Computing new variables
      5m 37s
    5. Combining or excluding outliers
      5m 21s
    6. Transforming outliers
      5m 40s
  5. 28m 12s
    1. Selecting cases
      6m 44s
    2. Using the Split File command
      5m 12s
    3. Merging files
      5m 33s
    4. Using the Multiple Response command
      10m 43s
  6. 22m 14s
    1. Calculating frequencies
      8m 43s
    2. Calculating descriptives
      5m 31s
    3. Using the Explore command
      8m 0s
  7. 16m 3s
    1. Calculating inferential statistics for a single proportion
      6m 6s
    2. Calculating inferential statistics for a single mean
      5m 39s
    3. Calculating inferential statistics for a single categorical variable
      4m 18s
  8. 30m 43s
    1. Creating clustered bar charts
      7m 10s
    2. Creating scatterplots
      5m 8s
    3. Creating time series
      3m 24s
    4. Creating simple bar charts of group means
      4m 17s
    5. Creating population pyramids
      3m 0s
    6. Creating simple boxplots for groups
      3m 3s
    7. Creating side-by-side boxplots
      4m 41s
  9. 45m 28s
    1. Calculating correlations
      8m 17s
    2. Computing a bivariate regression
      6m 27s
    3. Creating crosstabs for categorical variables
      6m 34s
    4. Comparing means with the Means procedure
      6m 33s
    5. Comparing means with the t-test
      6m 4s
    6. Comparing means with a one-way ANOVA
      6m 30s
    7. Comparing paired means
      5m 3s
  10. 24m 30s
    1. Creating clustered bar charts for frequencies
      6m 34s
    2. Creating clustered bar charts for means
      3m 45s
    3. Creating scatterplots by group
      4m 13s
    4. Creating 3-D scatterplots
      4m 25s
    5. Creating scatterplot matrices
      5m 33s
  11. 30m 57s
    1. Using Automatic Linear Models
      11m 52s
    2. Calculating multiple regression
      9m 3s
    3. Comparing means with a two-factor ANOVA
      10m 2s
  12. 29m 29s
    1. Formatting descriptive statistics
      6m 1s
    2. Formatting correlations
      7m 49s
    3. Formatting regression
      10m 19s
    4. Exporting charts and tables
      5m 20s
  13. 51s
    1. What's next
      51s

Watch this entire course now—plus get access to every course in the library. Each course includes high-quality videos taught by expert instructors.

Become a member
Please wait...
SPSS Statistics Essential Training
5h 5m Beginner Aug 17, 2011

Viewers: in countries Watching now:

In this course, author Barton Poulson takes a practical, visual, and non-mathematical approach to the basics of statistical concepts and data analysis in SPSS, the statistical package for business, government, research, and academic organization. From importing spreadsheets to creating regression models to exporting presentation graphics, this course covers all the basics, with an emphasis on clarity, interpretation, communicability, and application.

Topics include:
  • Importing and entering data
  • Creating descriptive charts
  • Modifying and selecting cases
  • Calculating descriptive and inferential statistics
  • Modeling associations with correlations, contingency tables, and multiple regression
  • Formatting and exporting tables and charts
Subjects:
Business Data Analysis
Software:
SPSS
Author:
Barton Poulson

Creating crosstabs for categorical variables

In the last two movies we looked at ways to assess the relationships between two variables. We looked at correlations, which work for pretty much any kind of variable, and we looked at bivariate linear regression, a closely related procedure, but one that doesn't work with categorical outcome variables. If you do have a categorical outcome variable and a categorical predictor, you can still use correlations as long as those variables are coded as 01 indicator variables. But it's more common to use what's called a crosstabulation or crosstab for short.

This is simply a table with rows and columns that crosses, hence the name crosstabulation, the combinations of categories in the two variables. Each box or cell in the table simply indicates how many people have that particular combination of the two categories. To do this example, I'm going to use the GSS dataset and I'm going to show the relationship between marital status in this particular dataset and overall levels of happiness. To do this, I first come up to Analyze, to Descriptive Statistics.

Now this one right here, Tables, refers to Custom Tables, which is a separate add-in that you pay for in SPSS. But the one that comes standard in everything is right here under Descriptive Statistics, to Crosstabs. That's the one I'm going to use in this example. All I need to do is specify the variables that I want to depict the rows and the columns. In this particular example, I'm going to use Married to separate the rows, so those will be the ones going across. The columns, which I'll use for my outcome variable, is going to be the indicator of happiness, and that is near the bottom of the dataset.

It's this one called Self-rated Happiness. I'm going to drag that up to the columns. Now if I do this, it will simply give me the number of people who fall into each category. There are generally a couple of things I want to add. The first one is under Statistics. I want to add a measure of association for this with something called a Chi-square. I click on that. That's a statistic that shows changes in distribution to cross-categorical variables. Press Continue.

The next one is what numbers I actually want to have in the cells. Now sometimes the two groups, like for instance Married and Not Married, can be very different sizes in which case it's hard to compare the raw frequencies. Instead what I might want to do is break down the percentages so I know what percentage of people who say they're married, say they're not too happy, or pretty happy or very happy. And the easiest way to do that is with what's called a Row Percentage, because I want to get the percentage of people going across who fall into each column.

Now if I have my data organized differently, I might want column percentages, where I look at the percentage of people in each column who fall into particular rows. Either way. In this one I just want to use a row percentage. So I'm going to press Continue now and then I'll just press OK. And what I have here first is the Case Processing Summary. This tells me that we had complete data from 349 people. Now I actually have complete data on these particular variables. If any of my cases were missing a value on one or the other of these variables, they wouldn't be included.

So crosstabs only work with complete data. This next table is the crosstabulation itself and what we have on the left is that says whether people reported that they were married or not married, so it's married yes and no. Across the top we have self-rated happiness with not too happy, pretty happy, and very happy. And what we see at the end of that is the totals, so there is a 170 people who were married and 179 were not married. It's coincidental that we have very close numbers on these ones.

And what you can see as we go across is the percentage of people who were married, who said for instance they were very happy, was 44.7%. That's 76 people out of 170. On the other hand of the people in this dataset who were not married, 44 of them said that they were very happy, which is 24.6%, so it's a lower percentage. The percentages of people who said they were pretty happy are close to each other for the two groups, 51.2% for those who are married, and 55.3% for those who weren't.

And the percentage of people who are not too happy changes also. We have 4.1% of the people who are married so they weren't too happy and 20.1% of the people who weren't married and say they weren't too happy. The last table is called the Chi-Square Text. That's the inferential statistic here and we're looking at the top one that says Pearson Chi-Square. The actual value of the test statistic is 28.653. The next number is what's called the degrees of freedom and it has to go into the calculations of the probability levels.

It has 2 degrees of freedom in this case. And this third number is the asymptotic significance level of 2-sided. That's the probability level that goes into the hypothesis test. In this case, it shows up as .000. It's not actually 0 all the way through, but it's a number that is smaller than .001. And what this shows us is that the distribution of self-rated happiness is different for the two groups on the marital status variable. It's important to remember again, this is simply showing a correlation of self-reported variables.

And why there might be an apparent association between these two is a whole different issue, but that's true of any measure of association. And so a crosstabulation is a great way to show the relationship between two categorical variables. By selecting the row or column percentages, you can make it easier to compare the groups. And the chi-square inferential test lets you know whether any differences you see are large enough to become statistically significant. And again, it's worth remembering that if your categories are dichotomies with only two groups, like yes/no or male/female and if the variables are coded as 01 indicator variables, then you can also get a correlation coefficient for the association that will have the same result on the significance test.

That is, it'll have the same probability value and the same result in terms of rejecting or retaining the null hypothesis. However, the row and column percentages are a nice perk of the crosstabs procedure and in any case, if your variables have more than two categories, then you would want to do the crosstab and Chi-square anyhow. And with that in mind, the next several movies will address ways to investigate the mean scores on scale variables for different groups.

There are currently no FAQs about SPSS Statistics Essential Training.

Share a link to this course
Please wait... Please wait...
Upgrade to get access to exercise files.

Exercise files video

How to use exercise files.

Learn by watching, listening, and doing, Exercise files are the same files the author uses in the course, so you can download them and follow along Premium memberships include access to all exercise files in the library.
Upgrade now


Exercise files

Exercise files video

How to use exercise files.

For additional information on downloading and using exercise files, watch our instructional video or read the instructions in the FAQ.

This course includes free exercise files, so you can practice while you watch the course. To access all the exercise files in our library, become a Premium Member.

Upgrade now

Are you sure you want to mark all the videos in this course as unwatched?

This will not affect your course history, your reports, or your certificates of completion for this course.


Mark all as unwatched Cancel

Congratulations

You have completed SPSS Statistics Essential Training.

Return to your organization's learning portal to continue training, or close this page.


OK
Become a member to add this course to a playlist

Join today and get unlimited access to the entire library of video courses—and create as many playlists as you like.

Get started

Already a member?

Become a member to like this course.

Join today and get unlimited access to the entire library of video courses.

Get started

Already a member?

Exercise files

Learn by watching, listening, and doing! Exercise files are the same files the author uses in the course, so you can download them and follow along. Exercise files are available with all Premium memberships. Learn more

Get started

Already a Premium member?

Exercise files video

How to use exercise files.

Ask a question

Thanks for contacting us.
You’ll hear from our Customer Service team within 24 hours.

Please enter the text shown below:

The classic layout automatically defaults to the latest Flash Player.

To choose a different player, hold the cursor over your name at the top right of any lynda.com page and choose Site preferencesfrom the dropdown menu.

Continue to classic layout Stay on new layout
Welcome to the redesigned course page.

We’ve moved some things around, and now you can



Exercise files

Access exercise files from a button right under the course name.

Mark videos as unwatched

Remove icons showing you already watched videos if you want to start over.

Control your viewing experience

Make the video wide, narrow, full-screen, or pop the player out of the page into its own window.

Interactive transcripts

Click on text in the transcript to jump to that spot in the video. As the video plays, the relevant spot in the transcript will be highlighted.

Thanks for signing up.

We’ll send you a confirmation email shortly.


Sign up and receive emails about lynda.com and our online training library:

Here’s our privacy policy with more details about how we handle your information.

Keep up with news, tips, and latest courses with emails from lynda.com.

Sign up and receive emails about lynda.com and our online training library:

Here’s our privacy policy with more details about how we handle your information.

   
submit Lightbox submit clicked