# Calculating inferential statistics for a single proportion

## Video: Calculating inferential statistics for a single proportion

For many people, when they think of statistics, they think of inferential statistics, and not always fondly. Of course, there is much more to statistics and data analysis than the calculation of probability values, and this should be evident by the amount of time we spent so far on graphics and descriptive statistics. However, the ability to go beyond the data at hand and make inferences about a larger group of people--hence the name inferential statistics--is one of the great beauties of analysis. In this set of movies, I want to start with the simplest kinds of inferential statistics, those for one variable at a time.

## Calculating inferential statistics for a single proportion

For many people, when they think of statistics, they think of inferential statistics, and not always fondly. Of course, there is much more to statistics and data analysis than the calculation of probability values, and this should be evident by the amount of time we spent so far on graphics and descriptive statistics. However, the ability to go beyond the data at hand and make inferences about a larger group of people--hence the name inferential statistics--is one of the great beauties of analysis. In this set of movies, I want to start with the simplest kinds of inferential statistics, those for one variable at a time.

There are few different procedures that we'll cover, such as confidence intervals and hypothesis tests, for scale variables and proportions, as well as the distribution of a single categorical variable. But let's start with what is probably the simplest and most familiar, the confidence interval and hypothesis test for a single proportion. For this example, I'm going to be using the GSS.sav data set. That stands for General Social Survey. And it has one variable on the end here that I think is interesting. If I scroll to the end, I have a variable here that's called ReadBook, and what it means is whether the person says that they've read a novel, a poem, or a play in last year.

We might be interested in the percentage of people who say that they have read one, whether that is significantly higher then, for example 50% and what the confidence interval for that might be, like you would get from a political poll where they say 73% of respondents plus or minus 3% who are in favor of a particular candidate. To do this, I'm going to many use one of SPSS's more interesting features. It's called nonparametric tests, and I get to it by going to the Analyze menu, down to Nonparametric Tests.

It's called nonparametric because we're not using parameters like means and standard deviations. Then I come over to One Sample. And here it will do a lot of things automatically, but I'm going to be a little bit selective and customize it to actually make things simpler for right now. The first thing I'm going to do is I'm going to come here to Fields, and that really means variables. And right now it's putting in nearly every variable. It would test for equality of distribution on categorical variables, and it would also test for scale variables, whether they are normally distributed like a bell curve.

I don't want to do all of that, so what I'm going to do is I'm going to take all of these variables, I'm going to put them back into the original field. The only test variable that I want is this one: Read Novel, Poem, or Play. So I'll double-click to move that over. Then I go to the Settings tab to choose exactly what test it is that I want to do. Now I'm going to do Customized tests here, and I'm going to choose Compare the observed binary probability--binary means two answers: yes or no--to the hypothesized value with what's called the binomial tests.

And click on Options, and what it's going to do is it's going to do a hypothesis test to see if the proportion of people who say they've read a novel, poem, or play in the last year is statistically significantly different from a hypothesized proportion, which right now I'll leave at 50%. I can also get what's called the confidence interval. That's like the plus or minus 3% in a political poll. Now sometimes you can use conventional statistics, but right here SPSS is doing a very nice thing and it's letting me use what's called an exact statistic. In this case, it's called the Clopper- Pearson for the confidence interval.

We don't need to go into any details except to say this would be a good choice. So I'm just going to click on that and I'm going to come down and press OK, and then I'm going to press Run. Now the output for this looks little different from what we've had so far, because it's a table with colors and shading in it. Also, it's not showing me everything right now. This is actually what's called a model viewer. Now right now, all it's telling me is that the proportion of people who say they've read a novel, poem, or play in the last year is significantly different from 50%. It's not telling me what's the actual proportion was or how far away it is, but I can get that through going onto the Model Viewer.

I'll double-click here and it brings up the Model Viewer. I'll maximize that window. And what I have here is the output that I saw on the other page. It tells me that the proportion of people who say they've read one of these is not 50%. It's significantly different from 50%. In fact, what I can do is I can come over here and the hypothesized, that 50%, is this blue bar right here. But what I really have is an observed 71% of the people say that they've read a novel, poem, or play in the last year. That's out of 349 people, and this tells me that that is significantly different from 0.

To get the confidence interval, I need to do one other thing. I come back over to this left pane and I go down to where it says View. Right now we're looking at the Hypothesis Summary. If I click on that, I can get the Confidence Interval Summary. It's a slightly different table here, and it tells me how it calculated the confidence interval by using the Clopper-Pearson. It tells me what the Parameter was, the probability that a person read a novel, a poem, or play in the last year. It tells me that the proportion of people who said yes, because they put ones instead of zeroed, is 71%. That corresponds to what I have over here.

The yes is the 71%. The confidence interval at the 95% confidence interval, which is the most common, is from 66% to 76%. And what this means is that while in my sample of 349 people 71% may have said they've read these, in the population of those 349 people came from, the true value could be somewhere between 66% and 76%. This is like the plus or minus 5% that you would get from a political poll. So the new nonparametric tests in SPSS is actually a very flexible procedure that can perform an entire range of tests all on its own.

It's also the easiest way to get confidence intervals and hypothesis tests for a single proportion. We'll come back to this procedure in another movie on testing nominal variables with multiple categories, but for now this should give you a good start on dealing with inferential statistics for dichotomous variables in SPSS. In the next movie, we'll look at common tests for scale variables.

Show transcript

#### This video is part of

SPSS Statistics Essential Training (2011)

52 video lessons · 21709 viewers

Author

Expand all | Collapse all
1. ### Introduction

2m 58s
1. Welcome
1m 5s
2. Using the exercise files
40s
3. Using a different version of the software
1m 13s
2. ### 1. Getting Started

19m 0s
1. Taking a first look at the interface
11m 49s
7m 11s
3. ### 2. Charts for One Variable

21m 54s
1. Creating bar charts for categorical variables
7m 18s
2. Creating pie charts for categorical variables
2m 54s
3. Creating histograms for quantitative variables
5m 45s
4. Creating box plots for quantitative variables
5m 57s
4. ### 3. Modifying Data

33m 10s
1. Recoding variables
5m 33s
2. Recoding with visual binning
5m 33s
3. Recoding by ranking cases
5m 26s
4. Computing new variables
5m 37s
5. Combining or excluding outliers
5m 21s
6. Transforming outliers
5m 40s
5. ### 4. Working with the Data File

28m 12s
1. Selecting cases
6m 44s
2. Using the Split File command
5m 12s
3. Merging files
5m 33s
4. Using the Multiple Response command
10m 43s
6. ### 5. Descriptive Statistics for One Variable

22m 14s
1. Calculating frequencies
8m 43s
2. Calculating descriptives
5m 31s
3. Using the Explore command
8m 0s
7. ### 6. Inferential Statistics for One Variable

16m 3s
1. Calculating inferential statistics for a single proportion
6m 6s
2. Calculating inferential statistics for a single mean
5m 39s
3. Calculating inferential statistics for a single categorical variable
4m 18s
8. ### 7. Charts for Two Variables

30m 43s
1. Creating clustered bar charts
7m 10s
2. Creating scatterplots
5m 8s
3. Creating time series
3m 24s
4. Creating simple bar charts of group means
4m 17s
5. Creating population pyramids
3m 0s
6. Creating simple boxplots for groups
3m 3s
7. Creating side-by-side boxplots
4m 41s
9. ### 8. Descriptive and Inferential Statistics for Two Variables

45m 28s
1. Calculating correlations
8m 17s
2. Computing a bivariate regression
6m 27s
3. Creating crosstabs for categorical variables
6m 34s
4. Comparing means with the Means procedure
6m 33s
5. Comparing means with the t-test
6m 4s
6. Comparing means with a one-way ANOVA
6m 30s
7. Comparing paired means
5m 3s
10. ### 9. Charts for Three or More Variables

24m 30s
1. Creating clustered bar charts for frequencies
6m 34s
2. Creating clustered bar charts for means
3m 45s
3. Creating scatterplots by group
4m 13s
4. Creating 3-D scatterplots
4m 25s
5. Creating scatterplot matrices
5m 33s
11. ### 10. Descriptive Statistics for Three or More Variables

30m 57s
1. Using Automatic Linear Models
11m 52s
2. Calculating multiple regression
9m 3s
3. Comparing means with a two-factor ANOVA
10m 2s
12. ### 11. Formatting and Exporting Tables and Charts

29m 29s
1. Formatting descriptive statistics
6m 1s
2. Formatting correlations
7m 49s
3. Formatting regression
10m 19s
4. Exporting charts and tables
5m 20s
13. ### Conclusion

51s
1. What's next
51s

### Start learning today

Sometimes @lynda teaches me how to use a program and sometimes Lynda.com changes my life forever. @JosefShutter
@lynda lynda.com is an absolute life saver when it comes to learning todays software. Definitely recommend it! #higherlearning @Michael_Caraway
@lynda The best thing online! Your database of courses is great! To the mark and very helpful. Thanks! @ru22more
Got to create something yesterday I never thought I could do. #thanks @lynda @Ngventurella
I really do love @lynda as a learning platform. Never stop learning and developing, it’s probably our greatest gift as a species! @soundslikedavid
@lynda just subscribed to lynda.com all I can say its brilliant join now trust me @ButchSamurai
@lynda is an awesome resource. The membership is priceless if you take advantage of it. @diabetic_techie
One of the best decision I made this year. Buy a 1yr subscription to @lynda @cybercaptive
guys lynda.com (@lynda) is the best. So far I’ve learned Java, principles of OO programming, and now learning about MS project @lucasmitchell
Signed back up to @lynda dot com. I’ve missed it!! Proper geeking out right now! #timetolearn #geek @JayGodbold
Share a link to this course

### What are exercise files?

Exercise files are the same files the author uses in the course. Save time by downloading the author's files instead of setting up your own files, and learn by following along with the instructor.

### Can I take this course without the exercise files?

Yes! If you decide you would like the exercise files later, you can upgrade to a premium account any time.

How to use exercise files.

Learn by watching, listening, and doing, Exercise files are the same files the author uses in the course, so you can download them and follow along Premium memberships include access to all exercise files in the library.

Exercise files

How to use exercise files.

This course includes free exercise files, so you can practice while you watch the course. To access all the exercise files in our library, become a Premium Member.

Are you sure you want to mark all the videos in this course as unwatched?

This will not affect your course history, your reports, or your certificates of completion for this course.

Congratulations

You have completed SPSS Statistics Essential Training (2011).

Course retiring soon

SPSS Statistics Essential Training (2011) will be retired from the lynda.com library on December 17, 2014. Training videos and exercise files will no longer be available, but the course will still appear in your course history and certificates of completion. For updated training, check out the all new SPSS Statistics Essential Training in the lynda.com Online Training Library.

Become a member to add this course to a playlist

Join today and get unlimited access to the entire library of video courses—and create as many playlists as you like.

Become a member to like this course.

Join today and get unlimited access to the entire library of video courses.

Exercise files

Learn by watching, listening, and doing! Exercise files are the same files the author uses in the course, so you can download them and follow along. Exercise files are available with all Premium memberships. Learn more

How to use exercise files.

Thanks for contacting us.
You’ll hear from our Customer Service team within 24 hours.

Please enter the text shown below:

The classic layout automatically defaults to the latest Flash Player.

To choose a different player, hold the cursor over your name at the top right of any lynda.com page and choose Site preferences from the dropdown menu.

• Mark video as unwatched
• Mark all as unwatched
Exercise files

Access exercise files from a button right under the course name.

Mark videos as unwatched

Remove icons showing you already watched videos if you want to start over.

Make the video wide, narrow, full-screen, or pop the player out of the page into its own window.

Interactive transcripts

Click on text in the transcript to jump to that spot in the video. As the video plays, the relevant spot in the transcript will be highlighted.

#### Get our Annual Premium Membership at our best savings yet.

Thanks for signing up.

We’ll send you a confirmation email shortly.

• new course releases
• general communications
• special notices

Keep up with news, tips, and latest courses with emails from lynda.com.

• new course releases