From the course: Data Science Foundations: Data Assessment for Predictive Modeling

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

The explore data task

The explore data task

From the course: Data Science Foundations: Data Assessment for Predictive Modeling

Start my 1-month free trial

The explore data task

- [Instructor] The explore data task is the bulk of the effort that you will spend on the entire phase. And it's easy to get distracted because you'll make some interesting discoveries along the way. It will feel almost like routine reporting and data visualization at times. But remember that we have somewhat different priorities. Try to remember that you're especially focused on things that look out of place. The reason that outliers, quirks, and various oddities deserve your attention is that you'll need to chat with a subject matter expert about them, so you'll want to discover them early. Basically anything that makes you wonder if it's the real value or might possibly be a mistake. So how do you go about it? Based upon the level of measurement run basic stats in simple graphics, you're looking for anything interesting that either indicates that a variable might be promising, that you have a question for an SME,…

Contents