From the course: Mistakes to Avoid in Machine Learning

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Not treating for imbalanced sampling

Not treating for imbalanced sampling

From the course: Mistakes to Avoid in Machine Learning

Start my 1-month free trial

Not treating for imbalanced sampling

- [Brett] In machine learning classification, you'll often encounter problems where the target variable you're trying to predict occurs very infrequently. And as a result, it can be very difficult to develop a good prediction. This is a problem of an imbalanced distribution and many machine learning techniques struggled to produce meaningful results. So imbalanced data is something you encounter in a classification problem in which the number of observations per class are disproportionately distributed. These problems can be difficult to solve, but luckily, there are several sampling techniques that you can leverage to improve your results. These problems can be difficult to solve, but luckily, there are several sampling techniques that you can leverage to improve your results. Introducing the imbalanced learn, imb-learn package. This is an excellent package which offers a variety of sampling techniques for dealing with…

Contents