Easy-to-follow video tutorials help you learn software, creative, and business skills.Become a member

Using the Explore command

From: SPSS Statistics Essential Training (2011)

Video: Using the Explore command

SPSS has a number of really wonderful tools for helping you to get an in-depth understanding of your data. We've already looked at the Frequencies and Descriptive commands, which can give you nearly everything you need under normal circumstances. However, there are times when you need to look at things even more closely and this is where SPSS's Explore command comes in, with more ways to look at univariate statistics than you can shake a stick at, and let's look at some of those possibilities. To get to the Explore command you go up to the Analyze menu, to Descriptives, to Explore.

Using the Explore command

SPSS has a number of really wonderful tools for helping you to get an in-depth understanding of your data. We've already looked at the Frequencies and Descriptive commands, which can give you nearly everything you need under normal circumstances. However, there are times when you need to look at things even more closely and this is where SPSS's Explore command comes in, with more ways to look at univariate statistics than you can shake a stick at, and let's look at some of those possibilities. To get to the Explore command you go up to the Analyze menu, to Descriptives, to Explore.

What you have here is a list of all the variables, both categorical and scale on the side, and a number of options here. What we are going to do is take the variables that we want and put them in the Dependent list. Now the term Dependent here means dependent variable, or an outcome variable, or the variables that you want a statistics on. In this case, I'll use the same ones that I used in the last ones. I'll use LastSale and I will use MarketCap.

Now Factor List is in case I want to break down the list. For instance, if I wanted to do LastSale and MarketCap by different sectors. I could do that, but there are 12 different sectors and at the moment I don't feel a need for it. I can also label the cases, and this can be handy because this will give me some charts that show outliers, and in fact I'm going to do that by coming up and getting a stock symbol and putting that down there. Then I want to go through some of the options over here.

I can choose what statistics Explore gives to me. I click on Statistics and by default it's going to give me the mean and a confidence interval for the mean. That's an indication of how spread out things are in mean and also given our sample what we think a true population value might be. We also have what are called M-estimators. That's a whole family of advanced what are called robust estimators that work well when things are skewed or they're outliers, but it's rather advanced.

We are not going to deal with that. I can also get information about outliers, which might label them individually. I could do that. I don't think we need to. I could also get percentiles, where for instance it gives me the values for the 5th, 10th, 25th, 50th, 75th, 90th, and 95th percentiles. You can do it manually in the Frequencies command, but it's nice to have it as a one-click option. However, I usually don't need that, so I am going to skip it right here. I'm just going to click Continue. So I am leaving the statistics at the default.

It has given me a ton. Next, I am going to look at Plots or the graphs. Now the first thing you can do is give me box plots, and we've done those separately in the univariate charts. And it's going to factor the levels together, which is fine, because I'm not splitting up the factors. It can also give me something called a stem-and-leaf plot, which is something that's normally drawn by hand, but I will show you that in a moment. I can get a histogram if I wanted. I've done those before, but I can get them additionally here. The next one is normality plots with tests.

This is a series of plots that are designed to see how well your data fit a symmetrical normal distribution-- that's a mathematical definition of a bell curve. Normality is the term for it, and that's important for a lot of statistics, but the normality plots can be a little tricky to read, and usually you can eyeball and see if your data seems to be behaving well, the way they would work well with a lot of other statistics. So I am going to skip both of those. I'll just click Continue, and let's take a quick look at options.

Now this is one where it asks what to do with missing values in case I'm looking at more than one variable in my Dependent List, which I am. The question is whether I want to exclude cases listwise or pairwise. And this is something that comes up in a number of other procedures, and it's worth pointing out. When you exclude cases listwise, what that means is you only include the case if it has information on every variable that you're including. So let's say I had ten variables in the Dependent list. If a case was missing information on one of those, it would not be included.

On the other hand, pairwise says include them whenever they have variables with some information. So it makes maximum use of the information, but you can end up with very different sample sizes, and there are procedures where it's very important to keep the sample sizes consistent going across. For Explore, that's a judgment call. You can do it either way. You can do it both if you want, one after the other. But I am just going to keep it listwise for now, the way it is. Click Continue and then down here it gives me the option to display just the statistics, just the plots, or both.

I will leave it at both, which is the default. I click OK and I get a lot of output. The first one tells me how many cases there are and whether they have valid data, how many are missing. There are 2,816 cases with missing data, and in each case I have four that are missing information on LastSale and MarketCap. That's just 1/10th of 1%. Then I have a table called Descriptives. I scroll down and I have the mean. The mean for LastSale is $18.7, and I've seen these statistics elsewhere, but this one gives me a confidence interval for the mean, which is an inferential statistic, and we will see more about those in the next section.

We also have something called a 5% Trimmed Mean. It shows us a way the highest and lowest few percentage points of the data and gives a slightly more stable estimate. We have the median and the indicators that spread with the variance and the standard deviation, and then we have several other statistics: the quartile and the skewness and kurtosis. So this is a lot of statistics that it gives all at once. You don't need all of them, but the nice thing is that they are available there. The second column, by the way, gives what are called standard error estimates for a few of the statistics, for the mean, the skewness, and the kurtosis.

These are sometimes used as inferential statistics, but we don't need to worry about them right now. Then it repeats the table for the second variable, market capitalization. Then we have what are called the stem-and-leaf plots. These are ones that are usually drawn by hand, and what is it is it takes the values and splits them up into two-digit numbers, where the first digit is what's called the stem, and it forms the line here on the side. The second number is the leaf, and the neat thing about this is this can be read as a histogram.

It's sort of a sideways histogram. But it also maintains the actual numerical values. So it's both a literal display of the data and a chart of a histogram, and then it marks some extreme cases separately at the bottom. Then here's a box plot. This is labeling the cases by their stock prices, and then we do a similar thing for market capitalization. So the biggest impression you might get might be that the Explore procedure is good for producing enormous amounts of output. It can be overwhelming, but if you really want to get the best picture or meaning the most comprehensive, not necessarily the most interpretable or useful picture, then the Explore command is the procedure of choice.

It can give you stem-and-leaf plots. It can give you confidence intervals and trimmed means. It can give you robust estimators. It can give you normality plots, among other things, if you ask for them, all of which recommend its use in particular circumstances. On the other hand, the slightly simpler procedures of Frequencies and Descriptives can still give you nearly all of what you need without deluging you with output. Nevertheless, if there's one thing SPSS is good at, it's providing you with options, and the Explore command is one with especially rich options and analytical value.

Show transcript

This video is part of

Image for SPSS Statistics Essential Training (2011)
SPSS Statistics Essential Training (2011)

52 video lessons · 20054 viewers

Barton Poulson
Author

 
Expand all | Collapse all
  1. 2m 58s
    1. Welcome
      1m 5s
    2. Using the exercise files
      40s
    3. Using a different version of the software
      1m 13s
  2. 19m 0s
    1. Taking a first look at the interface
      11m 49s
    2. Reading data from a spreadsheet
      7m 11s
  3. 21m 54s
    1. Creating bar charts for categorical variables
      7m 18s
    2. Creating pie charts for categorical variables
      2m 54s
    3. Creating histograms for quantitative variables
      5m 45s
    4. Creating box plots for quantitative variables
      5m 57s
  4. 33m 10s
    1. Recoding variables
      5m 33s
    2. Recoding with visual binning
      5m 33s
    3. Recoding by ranking cases
      5m 26s
    4. Computing new variables
      5m 37s
    5. Combining or excluding outliers
      5m 21s
    6. Transforming outliers
      5m 40s
  5. 28m 12s
    1. Selecting cases
      6m 44s
    2. Using the Split File command
      5m 12s
    3. Merging files
      5m 33s
    4. Using the Multiple Response command
      10m 43s
  6. 22m 14s
    1. Calculating frequencies
      8m 43s
    2. Calculating descriptives
      5m 31s
    3. Using the Explore command
      8m 0s
  7. 16m 3s
    1. Calculating inferential statistics for a single proportion
      6m 6s
    2. Calculating inferential statistics for a single mean
      5m 39s
    3. Calculating inferential statistics for a single categorical variable
      4m 18s
  8. 30m 43s
    1. Creating clustered bar charts
      7m 10s
    2. Creating scatterplots
      5m 8s
    3. Creating time series
      3m 24s
    4. Creating simple bar charts of group means
      4m 17s
    5. Creating population pyramids
      3m 0s
    6. Creating simple boxplots for groups
      3m 3s
    7. Creating side-by-side boxplots
      4m 41s
  9. 45m 28s
    1. Calculating correlations
      8m 17s
    2. Computing a bivariate regression
      6m 27s
    3. Creating crosstabs for categorical variables
      6m 34s
    4. Comparing means with the Means procedure
      6m 33s
    5. Comparing means with the t-test
      6m 4s
    6. Comparing means with a one-way ANOVA
      6m 30s
    7. Comparing paired means
      5m 3s
  10. 24m 30s
    1. Creating clustered bar charts for frequencies
      6m 34s
    2. Creating clustered bar charts for means
      3m 45s
    3. Creating scatterplots by group
      4m 13s
    4. Creating 3-D scatterplots
      4m 25s
    5. Creating scatterplot matrices
      5m 33s
  11. 30m 57s
    1. Using Automatic Linear Models
      11m 52s
    2. Calculating multiple regression
      9m 3s
    3. Comparing means with a two-factor ANOVA
      10m 2s
  12. 29m 29s
    1. Formatting descriptive statistics
      6m 1s
    2. Formatting correlations
      7m 49s
    3. Formatting regression
      10m 19s
    4. Exporting charts and tables
      5m 20s
  13. 51s
    1. What's next
      51s

Start learning today

Get unlimited access to all courses for just $25/month.

Become a member
Sometimes @lynda teaches me how to use a program and sometimes Lynda.com changes my life forever. @JosefShutter
@lynda lynda.com is an absolute life saver when it comes to learning todays software. Definitely recommend it! #higherlearning @Michael_Caraway
@lynda The best thing online! Your database of courses is great! To the mark and very helpful. Thanks! @ru22more
Got to create something yesterday I never thought I could do. #thanks @lynda @Ngventurella
I really do love @lynda as a learning platform. Never stop learning and developing, it’s probably our greatest gift as a species! @soundslikedavid
@lynda just subscribed to lynda.com all I can say its brilliant join now trust me @ButchSamurai
@lynda is an awesome resource. The membership is priceless if you take advantage of it. @diabetic_techie
One of the best decision I made this year. Buy a 1yr subscription to @lynda @cybercaptive
guys lynda.com (@lynda) is the best. So far I’ve learned Java, principles of OO programming, and now learning about MS project @lucasmitchell
Signed back up to @lynda dot com. I’ve missed it!! Proper geeking out right now! #timetolearn #geek @JayGodbold
Share a link to this course

What are exercise files?

Exercise files are the same files the author uses in the course. Save time by downloading the author's files instead of setting up your own files, and learn by following along with the instructor.

Can I take this course without the exercise files?

Yes! If you decide you would like the exercise files later, you can upgrade to a premium account any time.

Become a member Download sample files See plans and pricing

Please wait... please wait ...
Upgrade to get access to exercise files.

Exercise files video

How to use exercise files.

Learn by watching, listening, and doing, Exercise files are the same files the author uses in the course, so you can download them and follow along Premium memberships include access to all exercise files in the library.


Exercise files

Exercise files video

How to use exercise files.

For additional information on downloading and using exercise files, watch our instructional video or read the instructions in the FAQ.

This course includes free exercise files, so you can practice while you watch the course. To access all the exercise files in our library, become a Premium Member.

Join now "Already a member? Log in

Are you sure you want to mark all the videos in this course as unwatched?

This will not affect your course history, your reports, or your certificates of completion for this course.


Mark all as unwatched Cancel

Congratulations

You have completed SPSS Statistics Essential Training (2011).

Return to your organization's learning portal to continue training, or close this page.


OK
Become a member to add this course to a playlist

Join today and get unlimited access to the entire library of video courses—and create as many playlists as you like.

Get started

Already a member?

Become a member to like this course.

Join today and get unlimited access to the entire library of video courses.

Get started

Already a member?

Exercise files

Learn by watching, listening, and doing! Exercise files are the same files the author uses in the course, so you can download them and follow along. Exercise files are available with all Premium memberships. Learn more

Get started

Already a Premium member?

Exercise files video

How to use exercise files.

Ask a question

Thanks for contacting us.
You’ll hear from our Customer Service team within 24 hours.

Please enter the text shown below:

The classic layout automatically defaults to the latest Flash Player.

To choose a different player, hold the cursor over your name at the top right of any lynda.com page and choose Site preferencesfrom the dropdown menu.

Continue to classic layout Stay on new layout
Exercise files

Access exercise files from a button right under the course name.

Mark videos as unwatched

Remove icons showing you already watched videos if you want to start over.

Control your viewing experience

Make the video wide, narrow, full-screen, or pop the player out of the page into its own window.

Interactive transcripts

Click on text in the transcript to jump to that spot in the video. As the video plays, the relevant spot in the transcript will be highlighted.

Are you sure you want to delete this note?

No

Your file was successfully uploaded.

Thanks for signing up.

We’ll send you a confirmation email shortly.


Sign up and receive emails about lynda.com and our online training library:

Here’s our privacy policy with more details about how we handle your information.

Keep up with news, tips, and latest courses with emails from lynda.com.

Sign up and receive emails about lynda.com and our online training library:

Here’s our privacy policy with more details about how we handle your information.

   
submit Lightbox submit clicked
Terms and conditions of use

We've updated our terms and conditions (now called terms of service).Go
Review and accept our updated terms of service.