From the course: Building Recommender Systems with Machine Learning and AI

Unlock this course with a free trial

Join today to access over 22,400 courses taught by industry experts.

Identify and eliminate outlier users

Identify and eliminate outlier users

- [Instructor] Let's pause for another hands-on exercise. Revisit our evaluate user CFScript again, and this time, modify the recommender such that users that have a rating count more than three standard deviations from the mean are excluded from consideration. This will eliminate so-called super users who have an outsized impact on your results. Let's measure the effect of filtering them out. To do this, you'll probably want to focus your attention on the MovieLens module again. The best place to filter out these outliers is in the function that actually loads the MovieLens dataset itself. Doing this easily will require some familiarity with the pandas module. So if you're new to pandas, you might want to just skip to my solution and learn from it. But if you are starting to feel comfortable with pandas, give it a shot yourself and compare your results to mine up next.

Contents