From the course: Data Science Foundations: Data Assessment for Predictive Modeling
Unlock the full course today
Join today to access over 22,600 courses taught by industry experts or purchase this course individually.
Anscombe's quartet
From the course: Data Science Foundations: Data Assessment for Predictive Modeling
Anscombe's quartet
- [Instructor] Okay, let's talk about Anscombe's quartet. This data, which you can find in the Originals folder, was developed by a statistician named Francis Anscombe way back in 1973. It has become justifiably famous as a cautionary tale. What he's trying to caution us about is don't pay attention only to descriptive statistics, and the absence of a graphical representation. So here's the trick. To understand what this demonstration is all about, you want to treat these various pairs almost as if they're separate datasets. We have an X1, Y1 pair, X2, Y2, X3, Y3 and so on. So let's first do some basic descriptive statistics. We can keep it very basic indeed. I'll go ahead and calculate an average here and then I'll close off the parens and we see that we get an average. And I'm just going to drag that across so that we can get an average for everything. And what do we have? We've got an average of nine on all of…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
-
-
-
-
-
(Locked)
The explore data task1m 1s
-
(Locked)
How to be effective doing univariate analysis and data visualization3m 18s
-
(Locked)
Anscombe's quartet6m 26s
-
(Locked)
The Data Explorer node feature in KNIME5m 14s
-
(Locked)
How to navigate borderline cases of variable type5m 11s
-
(Locked)
How to be effective in doing bivariate data visualization8m 34s
-
(Locked)
Challenge: Producing bivariate visualizations for case study 11m 18s
-
(Locked)
Solution: Producing bivariate visualizations for case study 15m 40s
-
(Locked)
-
-
-
-
-
-