From the course: Data Science Foundations: Data Assessment for Predictive Modeling
Unlock the full course today
Join today to access over 22,600 courses taught by industry experts or purchase this course individually.
The explore data task
From the course: Data Science Foundations: Data Assessment for Predictive Modeling
The explore data task
- [Instructor] The explore data task is the bulk of the effort that you will spend on the entire phase. And it's easy to get distracted because you'll make some interesting discoveries along the way. It will feel almost like routine reporting and data visualization at times. But remember that we have somewhat different priorities. Try to remember that you're especially focused on things that look out of place. The reason that outliers, quirks, and various oddities deserve your attention is that you'll need to chat with a subject matter expert about them, so you'll want to discover them early. Basically anything that makes you wonder if it's the real value or might possibly be a mistake. So how do you go about it? Based upon the level of measurement run basic stats in simple graphics, you're looking for anything interesting that either indicates that a variable might be promising, that you have a question for an SME,…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
-
-
-
-
-
(Locked)
The explore data task1m 1s
-
(Locked)
How to be effective doing univariate analysis and data visualization3m 18s
-
(Locked)
Anscombe's quartet6m 26s
-
(Locked)
The Data Explorer node feature in KNIME5m 14s
-
(Locked)
How to navigate borderline cases of variable type5m 11s
-
(Locked)
How to be effective in doing bivariate data visualization8m 34s
-
(Locked)
Challenge: Producing bivariate visualizations for case study 11m 18s
-
(Locked)
Solution: Producing bivariate visualizations for case study 15m 40s
-
(Locked)
-
-
-
-
-
-