Join Conrad Carlberg for an in-depth discussion in this video Multivariate nature of clustering, part of Business Analytics: Data Reduction Techniques Using Excel and R.
- [Instructor] In previous lessons,…you've had a very quick look at a pair of…statistical techniques called the analysis of variance,…or ANOVA, and multivariate analysis of variants, or MANOVA.…With ANOVA, we use one outcome variable,…and with MANOVA, we use multiple outcome variables.…You saw one way to decide whether two or more groups…are reliably different from one another…is to calculate the mean value for each group…on some outcome variable and see whether…the variability within the group is greater or less than…the variability between the groups.…
If the differences between groups are greater than…the differences within groups, you might conclude…that the differences between groups are reliable…and that if you ran the same experiment again…you would get a similar result.…We've also seen that even if our reliable difference…exists between the groups, that doesn't necessarily mean…that the groups, such as sex or political party,…are the cause of the difference in the outcome variable.…Without a solid experimental design,…
In this course, Conrad Carlberg explains how to carry out cluster analysis and principal components analysis using Microsoft Excel, which tends to show more clearly what's going on in the analysis. Then he explains how to carry out the same analysis using R, the open-source statistical computing software, which is faster and richer in analysis options than Excel. Plus, he walks through how to merge the results of cluster analysis and factor analysis to help you break down a few underlying factors according to individuals' membership in just a few clusters.
- Reviewing the problems created by an overabundance of data
- Understanding the rationale for clustering and principal components analysis
- Using Excel to extract principal components
- Using R to extract principal components
- Using R for cluster analysis
- Using Excel for cluster analysis
- Setting up confusion tables in Excel
- Using cluster analysis and factor analysis in concert