From the course: Mistakes to Avoid in Machine Learning
Unlock the full course today
Join today to access over 22,600 courses taught by industry experts or purchase this course individually.
Not treating for imbalanced sampling
From the course: Mistakes to Avoid in Machine Learning
Not treating for imbalanced sampling
- [Brett] In machine learning classification, you'll often encounter problems where the target variable you're trying to predict occurs very infrequently. And as a result, it can be very difficult to develop a good prediction. This is a problem of an imbalanced distribution and many machine learning techniques struggled to produce meaningful results. So imbalanced data is something you encounter in a classification problem in which the number of observations per class are disproportionately distributed. These problems can be difficult to solve, but luckily, there are several sampling techniques that you can leverage to improve your results. These problems can be difficult to solve, but luckily, there are several sampling techniques that you can leverage to improve your results. Introducing the imbalanced learn, imb-learn package. This is an excellent package which offers a variety of sampling techniques for dealing with…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
Assuming data is good to go2m 2s
-
(Locked)
Neglecting to consult subject matter experts1m 48s
-
(Locked)
Overfitting your models3m 25s
-
(Locked)
Not standardizing your data2m 57s
-
(Locked)
Focusing on the wrong factors2m 11s
-
(Locked)
Data leakage2m 40s
-
(Locked)
Forgetting traditional statistics tools1m 57s
-
(Locked)
Assuming deployment is a breeze1m 47s
-
(Locked)
Assuming machine learning is the answer1m 35s
-
(Locked)
Developing in a silo2m 16s
-
(Locked)
Not treating for imbalanced sampling3m 29s
-
(Locked)
Interpreting your coefficients without properly treating for multicollinearity3m 19s
-
(Locked)
Evaluating by accuracy alone6m 8s
-
(Locked)
Giving overly technical presentations1m 56s
-
-