From the course: Machine Learning and AI Foundations: Value Estimations

Unlock the full course today

Join today to access over 22,500 courses taught by industry experts or purchase this course individually.

Decide how much data you need

Decide how much data you need

From the course: Machine Learning and AI Foundations: Value Estimations

Start my 1-month free trial

Decide how much data you need

- [Instructor] In our dataset, we have house sale records for over 40,000 houses. For each house, we capture 18 separate features. The year it was built, how many bedrooms it has, and so on. When you're using machine learning in your own programs, you might not have access to tens of thousands of records. So how much data do you actually need to use machine learning successfully? Machine learning algorithms work best when your dataset covers all possible combinations of features in your model. For example, we want our home price dataset to include prices for big houses with lots of bathrooms and no garage and no pool, but also big houses with lots of bathrooms and no garage but with a pool. The more combinations that are represented, the better the model can do at capturing how each of these attributes affects the house's final price. If your dataset doesn't have a data point for a certain combination of features, the machine learning model won't be able to make a very good…

Contents