See that cluster analysis maximizes Wilks' Lambda by repetitive reassignment of subjects to clusters.
- [Instructor] The prior lesson mentioned…how the analysis of variants or ANOVA, enables you…to judge whether two or more groups are reliably different…on some quantitative measure such as revenue,…cholesterol level, or miles per gallon,…but there's nothing about ANOVA or about…its multivariate counterpart, MANOVA,…that implies causation.…That's a matter for the design of an experiment,…not for the statistical test.…Suppose that you were studying the effect…of exercise versus diet on cholesterol levels.…You're working with two hospitals that each…have 30 patients who want to lower their LDL levels,…LDL is one type of cholesterol.…
You have the patients at one hospital…go on a vegetarian diet for two months…while the patients at the other hospital…participate in a program of cardiovascular exercise…for the same two months.…At the end of the experiment, you find…that the average LDL level in the exercise group…is 20 points higher than in the diet group.…An analysis of variance indicates that a difference…that large would come about through random chance…
In this course, Conrad Carlberg explains how to carry out cluster analysis and principal components analysis using Microsoft Excel, which tends to show more clearly what's going on in the analysis. Then he explains how to carry out the same analysis using R, the open-source statistical computing software, which is faster and richer in analysis options than Excel. Plus, he walks through how to merge the results of cluster analysis and factor analysis to help you break down a few underlying factors according to individuals' membership in just a few clusters.
- Reviewing the problems created by an overabundance of data
- Understanding the rationale for clustering and principal components analysis
- Using Excel to extract principal components
- Using R to extract principal components
- Using R for cluster analysis
- Using Excel for cluster analysis
- Setting up confusion tables in Excel
- Using cluster analysis and factor analysis in concert